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(57) Abstract 



The invention relates to a bispecinc or bivalent antibody fragment analogue comprising a binding complex containing two polypeptide 
chains, whereby one polypeptide chain comprises two heavy chain variable domains (Vh) in series and die other polypeptide chain comprises 
two light chain variable domains (Vi) in series, the binding complex further containing two pairs of variable domains (Vh-A//Vl-A and 
Vh-B//Vl-B). The two Vh*s and/or the two V L *s are connected directly or via an intermediate peptide linker. Also a production method 
for such antibody fragment analogues is disclosed. 
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Title: A bi functional or bivalent antibody fragment 
analogue 

The invention relates to new bispecif ic or bivalent 
5 antibody fragment analogues, a process for preparing such 
antibody fragment analogues and various uses of such 
antibody fragment analogues. 

Background of the invention and prior art 

10 1, AntifrQfly structure 

Antibody molecules typically are Y-shaped molecules whose 
basic unit consist of four polypeptides, two identical 
heavy chains and two identical light chains, which are 
covalently linked together by disulfide bonds. Each of 

15 these chains is folded in discrete domains. The C- terminal 
regions of both heavy and light chains are conserved in 
sequence and are called the constant regions, also known as 
C- domains . The N- terminal regions, also known as V- domains, 
are variable in sequence and are responsible for the 

20 antibody specificity. The antibody specifically recognizes 
and binds to an antigen mainly through six short 
complementarity- determining regions located in their V- 
domains (see Figure 1) * 



25 



30 



35 



In this specification abbreviations are used having the 
following meaning. 



C- domain 

V- domain 

V L 

V H 

Fv 

scFv 

CDR 

ELISA 

PCR 



Constant domain 
Variable domain 

Variable domain of the light chain 

Variable domain of the heavy chain 

dual chain antibody fragment containing both a 

V H and a V L 

single- chain Fv (V H and V L genetically linked 
either directly or via a peptide linker) 
Complementarity- determining region 
Enzyme Linked Immuno Sorbent Assay 
Polymerase Chain Reaction 
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TMB 



PBST 



PBS 



IPTG 



IsoPropyl -S-ThioGalactopyranoside 
Phosphate Buffered Saline 

Phosphate Buffered Saline with 0.15% Tween 
3 , 3 ' , 5 , 5 ' -TetraMethylBenzidine 



It is generally known that proteolytic digestion of an 
antibody with papain yields three fragments. The fragment 
containing the CH2 and CK, domains of the two heavy chains 
connected by the complete hinge (see Figure 1) crystallises 
very easily and was therefore called Fc fragment. The two 
other fragments are identical and were called Fab 
fragments, as they contained the antigen -binding site. 
Digestion with pepsin is such that the two Fab's remain 
connected via the hinge, forming only two fragments: Fc' 
and Faiv 

The Fv is the smallest unit of an antibody which still 
contains the complete binding site (see Figure 1) and full 
antigen binding activity. It consists of only the V-dornains 
of the heavy and light chains thus forming a small, 
heterodimeric variable fragment or Fv. Fv's have a 
molecular weight of about 25 kD, which is only one sixth of 
the parent whole antibody (in the case of an IgG) . 
Previously Fv' s were only available by proteolysis in a 
select number of cases (Givol, 1991). The production of 
Fv's can now be achieved more routinely using genetic 
engineering methods through cloning and expressing DNA 
encoding only the V- domains of the antibody of interest. 
Smaller fragments, such as individual V- domains (Domain 
Antibodies or dABs, Ward et al., 1989), and even individual 
CDR's (Williams et al., 1989; Taub et al. # 1989) were shown 
to retain the binding characteristics of the parent 
antibody. However, this is not achievable on a routine 
basis: most naturally occurring antibodies need both a V H 
and a V L to retain full immunoreactivity . For example, in 
the case of V H D1.3 (Ward et al., 1989), although it still 
binds hen egg lysozyme (HEL) with an affinity close to that 
of the parent antibody, it was shown that loss of 
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specificity was observed in that it can no longer 
distinguish turkey lysozyme from HEL, whereas the Pv can 
(Berry and Davies, 1992) . Although murine dABs can be 
obtained more routinely from spleen libraries (Ward et al., 
5 1989) , the approach is unsustainable because of the many 
problems associated with their production and physical 
behaviour: expression is extremely poor, affinity tends to 
be low, stability and solubility in water is low, and non- 
specific binding is usually very high. According to the 

10 literature a possible explanation of these undesirable 

characteristics is the exposure of the hydrophobic residues 
which are normally buried in the V ri -V L interface. The 
exposed hydrophobic patches are thought to contribute to 
aggregation of the protein inside the cells and/or in the 

15 culture medium, leading to poor expression and/or poor 

solubility (Anthony et al., 1992; Ward et al., 1989). The 
hydrophobic patches can also explain the high non-specific 
binding described by Berry and Davies, 1992. These problems 
clearly limit the usefulness of these molecules. 

20 Most of the Camelid antibodies appear to be an exception to 
this rule in that they only need one V- domain, namely V H , 
to specifically and effectively bind an antigen (Harriers - 
Castermans et al., 1993). In addition, preliminary data 
indicate that they seem not to suffer from the 

25 disadvantages of mouse dABs, as these camelid antibodies or 
fragments thereof are soluble and have been shown to 
express well in yeast and Aspergillus moulds. These 
observations can have important consequences for the 
production and exploitation of antibody -based products, see 

30 patent application WO 94/25591 (UNILEVER et al., first 
priority date 29.04.93). 

2. Product ion of antibody fragment? 

Several microbial expression systems have already been 
35 developed for producing active antibody fragments, e.g. the 
production of Fab in various hosts, such as B. coli (Better 
et al., 1988, Skerra and Pluckthun, 1988, Carter et al.. 
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1992), yeast (Horwitz et al., 1988), and the filamentous 
fungus Trichoderma reesei (Nyyssonen et ai., 1993) has been 
described. The recombinant protein yields in these 
alternative systems can be relatively high (1-2 g/1 for Fab 
5 secreted to the periplasmic space of E. coli in high cell 
density fermentation, see Carter et al., 1992), or at a 
lower level, e.g. about 0.1 mg/1 for Fab in yeast in 
fermenters (Horwitz et al., 1988), and 150 mg/i for a 
fusion protein CBHI-Fab and 1 mg/1 for Fab in Trichoderma 

10 in fermenters (Nyyssonen et al., 1993) and such production 
is very cheap compared to whole antibody production in 
mammalian cells (hybridoma, myeloma, CHO) . Although the 
latter can give yields of the order of 1 g/1 in high cell 
density fermentation, it is a time-consuming and very 

15 expensive manufacturing method resulting in a cost price of 
about 1000 £/gram of antibody. It was further demonstrated 
that plants can be used as hosts for the production of both 
whole antibodies (Hiatt et al., 1989} and scFv's (Owen et 
al., 1992, Firek et al., 1993), whereby yields of upto 0.5% 

20 of the total soluble protein content in tobacco leaves were 
mentioned. 

The fragments can be produced as Fab's or as Fv's, but 
additionally it has been shown that a V H and a V L can be 
genetically linked in either order by a flexible 
25 polypeptide linker, which combination is known as an scFv 
(Bird et al. (1988), Huston et al. (1988), and granted 
patent EP-B-0281604 (GENEX/BNZON LABS INC.; first priority 
date 02-09-1986) . 

30 3. Bivalent and bispecific antibodies and antibody 

fragments 

The antibody fragments Fab, Fv and scFv differ from whole 
antibodies in that the antibody fragments carry only a 
single antigen-binding site. Recombinant fragments with two 

35 binding sites have been made in several ways, for example, 
by chemical cross -linking of cysteine residues introduced 
at the C-terminus of the V H of an Fv (Cumber et al., 1992), 
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or at the C- terminus of the V L of an scFv (Pack and 
Pluckthun, 1992) , or through the hinge cysteine residues of 
Fab's (Carter et al. # 1992). Another approach to produce 
bivalent antibody fragments is described by Kostelny et al. 
5 (1992) and Pack and Pluckthun (1992) and is based on the 
inclusion of a C- terminal peptide that promotes 
dimerizat ion . 

When two different specificities are desired, one can 
generate bispecific antibody fragments. The traditional 

10 approach to generate bispecific whole antibodies was to 
fuse two hybridoma cell lines each producing an antibody 
having the desired specificity • Because of the random 
association of immunoglobulin heavy and light chains, these 
hybrid hybridomas produce a mixture of up to 10 different 

15 heavy and light chain combinations, only one of which is 
the bispecific antibody (Milstein and Cuello, 1983) . 
Therefore, these bispecific antibodies have to be purified 
with cumbersome procedures, which considerably decrease the 
yield of the desired product. 

20 Alternative approaches include in-vitro linking of two 

antigen specificities by chemical cross- linking of cysteine 
residues either in the hinge or via a genetically 
introduced C- terminal Cys as described above . An 
improvement of such in vitro assembly was achieved by using 

25 recombinant fusions of Fab's with peptides that promote 

formation of heterodimers (Kostelny et al., 1992). However, 
the yield of bispecific product in these methods is far 
less than 100%. 

A more efficient approach to produce bivalent or bispecific 
30 antibody fragments, not involving in vitro chemical 

assembly steps, was described by Holliger et al. (1993). . 
This approach takes advantage of the observation that 
scFv's secreted from bacteria are often present as both 
monomers and dimers. This observation suggested that the V H 
35 and V L of different chains can pair, thus forming dimers and 
larger complexes. The dimeric antibody fragments, also 
named "diabodies" by Hollinger et al., in fact are small 
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bivalent antibody fragments that assembled in vivo. By 
linking the V H and V L of two different antibodies 1 and 2, 
to form "cross -over" chains V H 1V L 2 and V H 2-V L 1 (see Figure 
2B) , the dimerisation process was shown to reassemble both 
5 antigen-binding sites. The affinity of the two binding 

sites was shown to be equal to the starting scFv's, or even 
to be 10- fold increased when the polypeptide linker 
covalently linking V H and V L was removed, thus generating 
two proteins each consisting of a v H directly and 

10 covalently linked to a V L not pairing with the V H (see 

Figure 2C) . This strategy of producing bispecif ic antibody 
fragments was also described in several patent 
applications. Patent application WO 94/09131 (SCOTGEN LTD; 
priority date 15.10.92) relates to a bispecif ic binding 

15 protein in which the binding domains are derived from both 
a V H and a V L region either present at two chains or linked 
in an scFv, whereas other fused antibody domains, e.g. C- 
terminal constant domains, are used to stabilise the 
dimeric constructs. Patent application WO 94/13804 

20 (CAMBRIDGE ANTIBODY TECHNOLOGY / MEDICAL RESEARCH COUNCIL; 
first priority date 04.12.92) relates to a polypeptide 
containing a V H and a V L which are incapable of associating 
with each other, whereby the V- domains can be connected 
with or without a linker. 

25 Mallender and Voss, 1994 (also described in patent 

application WO 94/13806; DOW CHEMICAL CO; priority date 
11.12.92) reported the in vivo production of a single- chain 
bispecif ic antibody fragment in E. coli. The bispecif icity 
of the bivalent protein was based on two previously 

30 produced monovalent scFv molecules possessing distinct 

specificities, being linked together at the genetic level 
by a flexible polypeptide linker. The thus formed V H l- 
1 inker - V L l-l inker -V H 2-1 inker -V L 2 fragment (see Figure 2A) 
was shown to contain both antigen binding specificities l 

35 and 2. (1= anti- fluorescein, 2= ant i- single -stranded DNA) . 
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Traditionally, whenever single- chain antibody fragments are 
referred to, a single molecule consisting of one heavy 
chain linked to one (corresponding) light chain in the 
presence or absence of a polypeptide linker is implicated. 
5 When making bivalent or bispecif ic antibody fragments 

through the 'diabody' approach (Holliger et al., (1993) and 
patent application WO 94/09131) or by the 'double scFv' 
approach (Mallender and Voss, 1994 and patent application 
WO 94/13806), again the V H is linked to a (the 

10 corresponding) V L . 

It is realised that claims 32 and 33 of patent application 
WO 93/11161 (ENZON INC.; priority date 25.11.91) and the 
corresponding passages in that specification on page 22, 
lines 1-10 may read on a polypeptide comprising two V L 's 

15 fused together via a flexible polypeptide linker, and on a 
polypeptide comprising two V H 's fused together via a 
flexible polypeptide linker, respectively. However, no 
examples were given to substantiate this approach, thus it 
was in fact a hypothetical possibility instead of an 

20 actually produced compound. 

A skilled person would not have expected that such approach 
would be viable for at least three reasons. Firstly, it is 
widely recognised that immunoglobulin heavy chains 
(excluding the above described camel immunoglobulins) have 

25 very limited solubility and spontaneously precipitate out 
of aqueous solution when isolated from their light chain 
partners. Secondly, several groups have shown (Ward et al., 
1989, Berry and Davies, 1992, and Anthony et al., 1992) 
that expression of V H 's in the absence of V L 's is hampered 

30 by extremely poor yields of unstable product with many 

undesirable properties, e.g. non-specific binding. Thirdly 
in patent application WO 94/13804 it was described on page 
31 lines 10-12, that in computer modelling experiments they 
could not model as heterodimers V H -V H and V L -V L given the 

35 constraints of short linkers. 

Thus the simple suggestion given in patent application WO 
93/11161 is not an enabling disclosure leading a skilled 
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person to try with a reasonable expectation of success 
whether such suggestion would work; therefore, that patent 
application should not be considered as relevant prior art 
for the present invention. 

5 

gwtmiay Y of the invention 
The present invention provides a bispecific or bivalent 
antibody fragment analogue, which comprises a binding 
complex containing two polypeptide chains, one of which 
10 comprises two times a variable domain of a heavy chain (V H ) 
in series and the other comprises two times a variable 
domain of a light chain (V L ) in series. 

In one aspect of the invention one chain of the antibody 
fragment analogue comprises a first V H (V H -A) connected to a 

15 second V H (V H -B) and the other chain comprises a first V L 
(V L -A) connected to a second V L (V L -B) . In a preferred 
embodiment of this aspect one chain comprises a first V H 
(V H -A) followed by a second V H (V H -B) , thus [V H -A * V H -B] , 
and the other chain comprises a first V L (V L -A) preceded by 

20 a second V L (V L -B) , thus [V L -B * V L -A] . For some embodiments 
of this aspect the two V H 's are directly connected to each 
other, but for other embodiments of this aspect of the 
invention the two V L 's are directly connected to each other.. 
According to another embodiment of this aspect of the 

25 invention the two V H 's are connected to each other by a 

linker and also the two V L 's are connected to each other by 
a linker. Such a linker usually comprises at least one 
amino acid residue. 

According to a special embodiment of this aspect of the 
30 invention one chain comprises a first V„ (V H -A) followed by 
a second V H (V H -B) , thus [V H -A * V H -B] , and the other chain 
comprises a first V L (V L -A) followed by a second V L (V L -B) , 
thus IV L -A * V L -B] , and in which the two V H 's are connected 
to each other by a linker and also the two V L 's are 
35 connected to each other by a linker, whereas each linker 
comprises at least 10 amino acid residues. 
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According to the above aspect of the invention with A being 
different from B there are provided bispecific antibody 
fragment analogues. 

According to another aspect of the invention the 
5 specificities A and B are the same resulting in a bivalent 
antibody fragment. 

According to a further aspect of the invention the 
bispecific or bivalent antibody fragment analogues can be 
used in a diagnostic technique or for immunoassays, in a 
10 purification method, for therapy, or in other methods in 
which immunoglobulins or fragments thereof are used. Such 
uses are well-known in the art. 

The invention also provides a process for producing the 
antibody fragments of the invention in that a host is 
15 transformed by incorporating into that host a DNA encoding 
the two V H 's with or without a connecting linker and a DNA 
encoding the two V L 's with or without a connecting linker. 
Preferably the two DNA's are placed in a dicistronic 
arrangement . 

20 It is also possible that the two linked V H 's and the two 
linked V L 's are produced separately by different hosts, 
after which the linked V H 's produced by one host can be 
combined with the linked V L 's produced by the other host. 
The hosts can be selected from the group consisting of 

25 prokaryotic bacteria of which examples are Gram- negative 
bacteria, e.g. E. coll, and Gram-positive bacteria, e.g. B. 
subtilis or lactic acid bacteria, lower eukaryotes examples 
of which are yeasts, e.g. belonging to the genera 
Saccharomyces, Kl uyveromyces , or Trichodexma, moulds, e.g. . 

30 belonging to the genera Aspergillus and Neurospora, and 
higher eukaryotes, exanqples of which are plants, e.g. 
tobacco, and animal cells, examples of which are myeloma 
cells and CHO cells. The techniques to transform a host by 
genetic engineering methods in order to have a desirable 

35 polypeptide produced by such host are well-known to persons 
skilled in the art as is evident from the literature 
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mentioned above under the heading "Background of the 
invention and prior art". 



5 Figure 1 
Figure 2 



10 



Figure 3 



15 



Figure 4 



20 Figure 5 



25 Figure 6 



30 Figure 7 



35 Figure 8 



Brief description of the drawings 

depicts in schematic form the structure of a 
typical antibody (immunoglobulin) molecule, 
shows a schematic representation of published 
arrangements of heavy and light chain V- domain 
gene fragments that have been proven to 
produce bispecific antibody fragments, 
shows in diagrammatic form the suggested 
arrangement of the V- domains of a double head 
antibody fragment according to the invention 
with the V-domains in the following order: 
V„A-V H B + V L B-V L A. 

shows the nucleotide sequence of the EcoRI- 
Hindi I I insert of pUR.4124 containing DNA (see 
SEQ ID NO: 1) encoding V L Lys- Linker -V H Lys (see 
SEQ ID NO: 2) . 

shows the nucleotide sequence of the Hindlll- 
EcoRl insert of plasmid Fv.3418 (see SEQ ID 
NO: 3) containing DNA encoding pelB leader- 
V H 3418 (see SEQ ID NO: 4) and DNA encoding 
pelB leader- V L 3418 (see SEQ ID NO: 5) . 
shows the nucleotide sequence of the Hindlll- 
EcoRl insert of plasmid Fv.4715-myc (see SEQ 
ID NO: 6) containing DNA encoding pelB leader - 
V H 4715 (see SEQ ID NO: 7) and DNA encoding 
pelB leader- V L 4715-Myc tag (see SEQ ID NO: 8) . . 
shows the nucleotide sequence of the Hindlll- 
EcoRI insert of scFv.4715-myc containing DNA 
(see SEQ ID NO: 9) encoding pelB leader- 
V H 4715-Linker-V L 4715-Myc tag (see SEQ ID NO: 
10) . 

a/b shows the nucleotide sequence of the Hindlll- 
ScoRI insert of pGOSA.E (see SEQ ID NO: 11) 
containing DNA encoding pelB leader- V H 4715- 
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20 



25 



Figure 9 



Figure 10 



15 

Figure 11 
Figure 12 

Figure 13 

Figure 14 

Figure 15 

30 Figure 16 



35 Figure 17 



Linker- V L 3418 (see SEQ ID NO: 12) and DNA 
encoding pelB leader- V L 3418- Linker- V H 4715 (see 
SEQ ID NO: 13) . 

gives an overview of the oligonucleotides and 
their positions in pGOSA.B that can be used to 
replace V- domain gene fragments, 
illustrates the amino acid sequence of the V H - 
V H and V L -V L do main ju nctions in fusion 
polypepti des GOSA.B (see amm o acids 114-145 
in SEQ ID NO: 12 and amino acids 102-128 in 
SEQ ID NO: 13), GOSA.V (see SEQ ID NO: 30 and 
amino acids 102-128 in SEQ ID NO: 13) , GOSA.S 
(see amino acids 114-145 in SEQ ID NO: 12 and 
SEQ ID NO: 31) and GOSA.T (see SEQ ID NO: 30 
and SEQ ID NO: 31) . 

shows the specificity of Streptococcus binding 
of scFv.4715-myc. 

shows the specificity of glucose oxidase 

targeting onto the surface of various 

Streptococcus strains by GOSA.E. 

shows the specificity of glucose oxidase 

targeting onto the surface of various 

Streptococcus strains by GOSA.V. 

shows the specificity of glucose oxidase 

targeting onto the surface of various 

Streptococcus strains by GOSA.S. 

shows the specificity of glucose oxidase 

targeting onto the surface of various 

Streptococcus strains by GOSA.T. 

shows the results of an ELISA . Individual 

fractions of a gelf iltration experiment using 

partially purified GOSA.E as feedstock were 

tested for glucose oxidase and Streptococcus 

sanguis bispecific binding activity. 

shows the results of an ELISA. Individual 

fractions of a gelf iltration experiment using 

partially purified GOSA.V as feedstock were 
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tested for glucose oxidase and Streptococcus 
sanguis bispecific binding activity. 
Figure 18 shows the results of an ELISA. Individual 

fractions of a gelf iltration experiment using 
5 partially purified GOSA.S as feedstock were 

tested for glucose oxidase and Streptococcus 
sanguis bispecific binding activity. 
Figure 19 shows the results of an ELISA. Individual 

fractions of a gelf iltration experiment using 
10 partially purified GOSA.T as feedstock were 

tested for glucose oxidase and Streptococcus 
sanguis bispecific binding activity. 
Figure 20 shows the source of fragment PCR.I BstEII/SacI 
Figure 21 shows the source of fragment PCR.II Sfil/EcoRI 
15 Figure 22 shows the source of fragment PCR.III Nhel/SacX 
Figure 23 shows the source of fragment PGR. IV Xhol/EcoRl 
Figure 24 shows the source of fragment PCR.V Sall/BcoRl 
Figure 25 shows the source of fragment PCR.VI Sfil/Nhel 
Figure 26 shows the source of fragment PCR.VI I 
20 BstEII/Nhel 

Figure 27 shows the source of fragment PCR.VIII 
JQjoI/EcoRI 

Figure 28 shows the source of fragment PCR.IX 
BstEII/Nhel 

25 Figure 29 shows the source of fragment PCR.X Pstl/EcoRl 
Figure 30 shows the construction of plasmid pGOSA.A 
Figure 31 shows the construction of plasmid pGOSA.B 
Figure 32 shows the construction of plasmid pGOSA.C 
Figure 33 shows the construction of plasmid pGOSA.D 

30 Figure "34 shows the construction of plasmid pGOSA.E 
Figure 35 shows the construction of plasmid pGOSA.V 
Figure 36 shows the construction of plasmid pGOSA.S 
Figure 37 shows the construction of plasmid pGOSA.T 
Figure 38 a/b shows the construction of plasmid pGOSA.G 

35 Figure 39 shows the construction of plasmid pGOSA.J 
Figure 40 shows the construction of plasmid pGOSA.Z 
Figure 41 shows the construction of plasmid pGOSA.AA 
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Figure 42 shows 

Figure 43 shows 

Figure 44 shows 

Figure 45 m shows 

5 Figure 46 shows 

Figure 47 shows 
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the construction of 
the construction of 
the construction of 
the construction of 
the construction of 
the construction of 



plasmid pGOSA.AB 
plasmid pGOSA.L 
plasmid pGOSA.Y 
plasmid pGOSA.X 
plasmid pGOSA.AC 
plasmid pGOSA.AD. 



Table 1 shows the nucleotide sequence of the 

oligonucleotides used to produce the constructs 
described in this specification. Restriction sites 
encoded by these primers are underlined. 



Table 2 gives an overview of all GOSA constructs described 
in this specification. 
15 Table 2A describes intermediate constructs that 

were not further tested. 

Table 2B describes the dicistronic constructs. 
Table 2C describes the monocistronic constructs. 

20 Detailed description of the invention 

In this specification the construction of an antibody 
fragment analogue consisting of a two chain protein complex 
is described, in which one of the chains consists of two 
heavy chain V-domains and the other chain consists of the 

25 two corresponding light chain V-domains in either order. 
The variable domains are linked either directly or through 
a polypeptide linker. Subsequent molecular modelling of 
this combination suggested that the protein chains could 
fold such that both binding sites are fully accessible, 

30 provided that the connecting linkers are kept long enough 
to span 30 to 35 A. 

Whereas in patent application WO 93/11161 it is explicitly 
described that for the above described bispecific complexes 
two flexible polypeptide linkers in the self asisembling 
35 complex are required, the present invention illustrated 
here describes in particular the construction of a two 
chain protein complex containing only one linker or no 
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linkers at all. The latter antibody fragment analogue thus 
consists of a two chain protein complex containing one 
polypeptide chain comprising heavy chain V- domains fused 
directly together and another polypeptide chain comprising 
5 the corresponding light chain V- domains fused together, 
both fusions in the absence of linkers. But also two chain 
protein complexes in which each chain comprises a linker 
between the two variable domains can be used as antibody 
fragment analogues according to the invention as described 
10 below with construct pGOSA.E. However, the two chain 

complexes containing only one linker or no linker at all 
are preferred. The abbreviation GOSA used in this 
specification relates to a combination of glucose oxidase 
and Streptococcus sanguis. 

15 

In this specification evidence is provided that these 
antibody fragment analogues ("double heads") contain both 
antigen binding specificities of the Fv's used to generate 
these bispecific antibody fragments. It is exemplified that 
20 these type of constructs according to the invention can be 
used to target the enzyme glucose oxidase to whole 
bacteria, using antibody fragments derived from hybridomas 
expressing antibodies directed against these antigens. 

25 The present invention is now described by reference to some 
specific examples, which are included for purposes of 
illustration only and are not intended to limit the scope 
of the invention. 

EXAMPLES 

General experimental 
Strains, Plasmids and Media 

All cloning steps were performed in E. coli JM109 (endAl, 
recAl, gyr A9 6 f thi, hsdR17(r K , m**) , relAl, supE44, A(lac- 
proAB) , [F' ( traD36, proAB, lacI q 2AM15] . E. coli cultures 
were grown in 2xTY medium (16 g tryptone, 10 g yeast 
extract, 5 g NaCl per litre HjO) , where indicated 



30 



35 
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supplemented with 2% glucose and/or 100 fig/ml ampicillin. 
Transformations were plated out on SOBAG plates (20 g 
tryptone, 5 g yeast extract, 15 g agar, 0.5 g NaCl per 
litre H 2 0 plus 10 mM MgCl 2 , 2% glucose, 100 ng/ml 
5 ampicillin) The expression vectors used are derivatives of 
pUC19. The oligonucleotide primers used in the PCR 
reactions were synthesized on an Applied Biosys terns 381A 
DMA Synthesiser by the phosphoramidite method. 

10 Expression of GOSA constructs 

Colonies from freshly transformed JM109 plated onto SOBAG 
plates were used to inoculate 2xTY medium supplemented with 
100 fig/ml ampicillin, 2% glucose. Cultures were shaken at 
37°C to an OD^ in the range of 0.5 to 1.0. Cells were 

15 pelleted by centrifugation and the supernatant was removed. 
The pelleted cells were resuspended in 2xTY medium with 100 
fiq/wl ampicillin, 1 mM IPTG, and grown for a further 18 
hours at 25°C. Cells were pelleted by centrifugation and 
the supernatant, containing the secreted chains, used 

20 directly in an EL ISA. The proteins in the periplasm of the 
pelleted cells were extracted by resuspending the cell 
pellet in 1/20 of the original culture volume of lysis 
buffer (20% sucrose, 200 rriM Tris-HCl pH 7.5, 1 mM EDTA, 500 
/xg/ml lysozyme) . After incubation at 25°C for 20 minutes an 

25 equal volume of H 2 0 was added and the incubation was 

continued for another 20 minutes. The suspension was spun 
at 10*000 g for 15 minutes and the supernatant containing 
the periplasmic proteins was used directly in an ELISA. 

30 ELISA 

96 well ELISA plates (Greiner HC plates) were activated 
overnight at 37°C with 200 fil/well of an 1/10 dilution of 
an over night culture of Streptococcus cells in 0.05 M 
sodium carbonate buffer at pH=9.5. Following one wash with 
35 PBST, the antigen sensitised plates were pre -blocked for 1 
hour at 37°C with 200 fil/w&ll blocking buffer (2% BSA, 
0.15% Tween in PBS). Samples containing 50 /il blocking 
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buffer plus 50 pi culture supernatants or periplasmic cell 
extracts (neat or diluted with PBS) were added to the 
Streptococcus sensitised plate and incubated for 2 hours at 
37°C. Following 4 washes with PBS-T, 100 fil of blocking 
5 buffer containing glucose oxidase (50 jig/ml) was added to 
every well. After incubation at 37°C for 1 hour unbound 
glucose oxidase was removed by 4 washes with PBS-T. Bound 
glucose oxidase was detected by adding 100 fil substrate to 
each well (70 mM Na-citrate, 320 mM Na-phosphate, 27 mg/ml 
10 glucose, 0.5 /xg/ml HRP, 100 ng/ml 1MB). The colour reaction 
was stopped after 1 hour by the addition of 35 fil 2 M HC1 
and the A450 was measured (compare Figures 11/15) . 

Affinity purification of GOSA antibody fragments u 

15 GOSA.E, GOSA.V, GOSA.S and GOSA.T were partially purified 
by affinity chromatography. 100 ml periplasmic extract of 
each of these constructs was loaded onto a Glucose- oxidase - 
Sepharose column (CNBr-Sepharose, Pharmacia) prepared 
according to the manufacturer's instructions. After 

20 extensive washes with PBS the bound GOSA antibody fragments 
were eluted in 0.1M glycine buffer at pH=2.8. The fractions 
were neutralised with Tris and analysed by polyacrylamide 
gel electrophoresis followed by silver staining and tested 
for the presence of double head activity. 

25 

EXAMPLE 1. Construction of the pGOSA double head 
expyegjgipn vwfcprp 

30 In this Example the construction of a two chain protein 
complex is described, in which one of the chains consists 
of two heavy chain V- domains and the other chain consists 
of the two corresponding light chain V- domains. The 
variable domains are linked either directly or through a 

35 polypeptide linker. The expression vectors used are 
derivatives of a pUC19 derived plasmid containing a 
Hindlll-EcoRI fragment that in the case of plasmid 
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scFv. 4715-myc contains a DNA fragment encoding one pelB 
signal sequence fused to the N- terminus of the V H that is 
directly linked to the corresponding V L of the antibody 
through a connecting flexible peptide linker, (Gly 4 Ser) 3 
5 (present in SEQ ID NO: 2 as amino acids 109-123 and in SEQ 
ID NO: 10 as amino acids 121-135), thus generating a 
single -chain molecule (see Figure 7) . 

In the dual -chain Fv and the pGOSA expression vectors, the 
DNA fragments encoding both the V H and V L of the antibody 

10 are preceded by a ribosome binding site and a DNA sequence 
encoding the pelB signal sequence in an artificial 
dicistronic operon under the control of a single inducible 
promoter (see Figures 5, 6 and 8) . Expression of these 
constructs is driven by the inducible lacZ promoter. The 

15 nucleotide sequence of the Hindlll-EcoRI inserts of the 
plasmids pUR.4124, Fv.3418, Fv. 4715-myc and scFv. 4715-myc 
constructs used for the generation of the bispecific 
antibody fragments are given in Figures 4-7, respectively. 
Moreover, a culture of E. coli cells harbouring plasmid 

20 scFv. 4715-myc and a culture of E. coli cells harbouring 

plasmid Fv.3418 were deposited under the Budapest Treaty at 
the National Collection of Type Cultures (Central Public 
Health Laboratory) in London (United Kingdom) with 
deposition numbers NCTC 12916 and NCTC 12915, respectively. 

25 In agreement with Rule 28 (4) EPC, or a similar arrangement 
for a State not being a Contracting State of the EPC, it is 
hereby requested that a sample of such deposit, when 
requested, will be submitted to an expert only. 
The construction of pGOSA.B (see Figure 8 for the Hindlll- 

30 EcoRI insert of pUC19) involved several cloning steps. The 
appropriate restriction sites in the various domains were 
introduced by PCR directed mutagenesis using the 
oligonucleotides listed in Table 1 below. The pGOSA.E 
derivatives pGOSA.V, pGOSA.S and pGOSA.T with only one or 

35 no linker sequence are derived from the pGOSA.E construct 
by removing the linker sequences by means of PCR directed 
mutagenesis with oligonucleotides listed in Table 1 below. 
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DBL.1 
DBL.2 

5 

DBL.3 

DBL.4 

10 DBL.S 

DBL.6 
DBL.7 

15 DBL.8 

DBL.9 



20 



DBL.10 



DBL.11 



DBL.12 

25 PCR.51 

PCR.89 

PCR.90 
30 PCR.116 
1=S/H,2 



Table 1. 

5' CAC CAT CTC CAG AGA CAA TGG CAA G-3' 

(=SEQ ID NO: 14) 
5'-GAG CGC GAG CTC GGC CGA ACC GGC C 'GA TCC GCC 
ACC GCC AGA GCC-3' (=SEQ ID NO: 15) 

5'-CAG GAT CC G GCC GGT TCG GCC ' CAG GTC CAG CTG 
CAA CAG TCA GGA-3' (=SEQ ID NO: 16) 

5'-CTA CAT GAA TTC 2 GCT AGC 3 TTA TTA TGA GGA GAC 
GGT GAC GGT GGT CCC TTG GC-3' (=SEQ ID NO: 17) 

5'-TAA TAA GCT AGC 3 GGA GCT GCA TGC AAA TTC TAT 
TTC-3* (= SEQ ID NO: 18) 

5'-ACC AAG CTC GAG 4 ATC AAA CGG GG-3'(= SEQ ID NO: 19) 
5 -AAT GTC GAA TTC 2 GTC GAC S TCC GCC ACC GCC AGA 
GCC-3* (= SEQ ID NO: 20) 

5 ' ATT GGA GTC GAC 5 ATC GAA CTC ACT CAG TCT CCA 
TTC TCC-3' (= SEQ ID NO: 21) 

5' TGA AGT GAA TTC 2 GCG_GCC_GC 6 T TAT TAC CGT TTG 
ATT TCG AGC TTG GTC CC-3' (= SEQ ID NO: 22) 

5'-CGA ATT C GG TCA CC g G TCT CCT CAC AGG TCC AGT 
TGC AAC AG-3* (= SEQ ID NO: 23) 

5' -CGA ATT CTC GAG 4 ATC AAA CGG GAC ATC GAA CTC 
ACT CAG TCT CC-3' (= SEQ ID NO: 24) 

5'-CGA ATT CGG TCA CC' G TCT CCT CAC AGG TGC AGT 
TGC AGG AG-3' (= SEQ ID NO: 25) 

5'-AGG T(C/G)(A/C) A(C/A)C_IgC_AG 7 (C/G) AGT C(A/T)G 
G-3' (= SEQ ID NO: 26) 

5'-TGA GGA GAC GGT GAC C 8 GT GGT CCC TTG GCC CC-3' 

(= SEQ ID NO: 27) 

5'^GAC ATT GAG CTC 9 ACC CAG TCT CCA-3{= SEQ ID NO: 28) 
5'-GTT AGA TCT CGA G 4 CT TGG TCC C-3' (= SEQ ID NO: 29) 
=EcoW, 3=Nhel, 4=Xhol, 5=SaH, 6=Notl, 7=Pstl, S=BstHl, 9= Sort 
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These three constructs lack some of the restriction sites 
at the new joining points. The V„A-V H B gene fragment without 
a linker lacks the 5' V H B Sfil site. The V L B-V L A gene 
fragment without a linker lacks the 5' V L A Sail site. 
5 The position of the oligonucleotides in the pGOSA 

constructs given in Table 1 are shown in Figure 9. The 
pGOSA expression vectors and the oligonucleotides in Table 
1 have been designed to enable most specificities to be 
cloned into the pGOSA constructs. Figure 10 shows the amino 
10 * acid sequence of the junctions between the VhA-V h B and V L B- 
V L A fragments encoded by DNA present in pGOSA.E, pGOSA.V, 
pGOSA.S and pGOSA.T. A more detailed description of the 
preparation of pGOSA.E, pGOSA.V, pGOSA.S and pGOSA.T is 
given in Example 5. 

15 

^ EXAMPLE 2. Bifunctional binding activity of 60SA double 
heads _ 

20 In this Example we provide evidence that the above 

described molecules ("double heads"), i.e. the two chain 
protein complexes, contain both antigen binding 
specificities of the Fv's used to generate these multi- 
functional antibody fragment analogues. Figure 12-15 show 

25 that GOSA.E, GOSA.V, GOSA.S and GOSA.T can be used to 

specifically target the enzyme glucose oxidase to several 
Streptococcus sanguis strains using antibody fragments 
derived from hybridoma's expressing antibodies directed 
against these antigens. 

30 Comparison of the binding specificity of the GOSA 

constructs (see Figures 12-15) and the binding specificity 
of the scFv.4715-myc (see Figure 11) shows that the fine 
specificity of the anti- Streptococcus sanguis scFv.4715 is 
preserved in the GOSA "double heads". 

35 
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EXAMPLE 3 . FPIiC analysis of GOSA double heads 

Partially purified GOSA.E, GOSA.V, GOSA.S and GOSA.T 
samples (estimated to be 50-80% pure by polyacrylamide gel 
5 electrophoresis) were analysed on a Pharmacia FPLC Superose 
12 column. The analysis was performed using PBS at a flow 
rate of 0.3 ml/minute. Eluate was monitored at 280 run and 
0.3 ml fractions were collected and analysed by ELISA. 
Usually GOSA.E, GOSA.V, GOSA.S and GOSA.T samples only gave 

10 one GOSA double head activity peak as determined by ELISA 
(see Figures 16-19),. The position of this peak in the 
elution pattern indicated that the molecular weight of the 
GOSA double head is 40-50 kD. Since this molecular weight 
corresponds to the expected molecular weight of the V H 2 + 

15 V L 2 double head dimer, the conclusion is justified that 

GOSA.E, GOSA.V, GOSA.S and GOSA.T are primarily produced as 
dimeric molecules. Occasionally an activity peak with an 
apparent molecular weight of ~200 kD was observed (see 
Figure 16) . The presence of Glucose Oxidase activity in 

20 these fractions (data not shown) indicate that these 

fractions contain GOSA double head complexed with glucose 
oxidase that was eluted with the GOSA sample from the 
glucose oxidase -sepharose affinity matrix. 

25 

EXAMPLE 4 t Production gf Ptber double heftfls 

The methods described in the previous Examples were used to 

produce other double heads, which also appeared to be 
30 active against the antigens for which they were developed. 

These other double heads had the following specificities: 

anti-S. sanguis / anti-beta-HCG, 

anti-S. sanguis / anti-urease, 

anti-S. sanguis / anti-hen-egg-lysozyme, 
35 anti-beta-HCG / anti-hen-egg-lysozyme, 

anti-hen-egg-lysozyme /anti-glucose oxidase, 

anti-huIgG / anti -glucose oxidase, 
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anti- urease / anti- glucose oxidase, 
anti-lacto-peroxidase / anti-glucose oxidase, 
anti- alpha -HCG / anti -glucose oxidase, and 
anti- reactive-Red- 6 / anti-glucose oxidase. 

5 

EXAMPLE 5* Detailed description of the preparation of 
intermediate constructs pGOSA.A, pGOSA.B 
pGOSA.C and pGOSA.D and their use for the 
!0 preparation of plasmid pGOSA.E and its 

derivati ves pGOSA.V. pGOSA.S and pGOSA.T 

Oligonucleotides and PCR 

The primary structures of the oligonucleotide primers used 
15 in the construction of the bispecific 'pGOSA' constructs 
are shown in Table 1 above. Reaction mixture used for 
amplification of DNA fragments were 10 mM Tris-HCl, pH 8.3 # 
2.5 mM MgCl 2 , 50 mM KC1, 0.01% gelatin (w/v) , 0.1% Triton X 
100, 400 mM of each dNTP, 5.0 units of Vent DNA polymerase 
20 (New England Biolabs) , 100 ng of template DNA, and 500 ng 
of each primer (for 100 pi reactions) . Reaction conditions 
were: 94°C for 4 minutes, followed by 33 cycles of each 1 
minute at 94 °C, 1 minute at 55 °C, and 1 minute 72 °C. 

25 Plasmid DNA\Vec tor \ Insert preparation and 
ligation\ transformation . 

Plasmid DNA was prepared using the 'Qiagen P-100 Midi-DNA 
Preparation' system. Vectors and inserts were prepared by 
digestion of 10 fig (for vector preparation) or 20 fig (for 

30 insert preparation) with the specified restriction 

endonucleases under appropriate conditions (buffers and 
temperatures as specified by suppliers) . Modification of 
the DNA ends with Klenow DNA polymerase and 
dephosphorylation with Calf Intestine Phosphorylase were 

35 performed according to the manufacturers instructions. 

Vector DNAs and inserts were separated through agarose gel 
electrophoresis and purified with DEAE -membranes NA45 
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(Schleicher & Schuell) as described by Maniatis et al. 
Ligations were performed in 20 fil volumes containing 30 mM 
Tris-HCl pH 7.8, 10 mM MgCl 2 , 10 mM DTT, 1 mM ATP, 300-400 
ng vector DNA, 100-200 ng insert DNA and 1 Weiss unit T 4 DNA 
5 ligase. After ligation for 2-4 h at room temperature, CaCl 2 
competent E. coli JM109 (Maniatis) were transformed using 
7.5 pi ligation reaction. The transformation mixtures were 
plated onto SOBAG plates and grown overnight at 37°C. 
Correct clones were identified by restriction analysis and 
10 verified by automated dideoxy sequencing (Applied 
Biosystems) . 

Restriction digestion of PCR products 

Following amplification each reaction was checked for the 
15 presence of a band of the appropriate size by agarose gel 
electrophoresis. One or two 100 /zl PCR reaction mixtures of 
each of the PCR reactions PCR. I - PCR.X (Figure 20-29), 
together containing approximately 2-4 /xg DNA product were 
subjected to phenol -chloroform extraction, chloroform 
20 extraction and ethanol precipitation. The DNA pellets were 
washed twice with 70% ethanol and allowed to dry. Next, the 
PCR products were digested overnight (18 h) in the presence 
of excess restriction enzyme in the following mixes at the 
specified temperatures and volumes. 
25 PCH t I: 

50 mM Tris-HCl pH 8.0, 10 mM MgCl 2 , 50 mM NaCl, 4 mM 
spermidine, 0.4 fig /ml BSA, 4 /il (* 40 U) Sad, 4 /*1 
(» 40 U) BstEII, in 100 ^1 total volume at 37°C. 
PCR. II : 

30 10 mM Tris-Acetate pH 7.5, 10 mM MgAc 2 , 50 mM KAc (lx 

"One-Phor-All" buffer ex Pharmacia) , 4 /zl (= 48 U) Sfil, in 
50 fil total volume at 50°C under mineral oil. After 
overnight digestion, PCR.II-Sfil was digested with BcoRl 
(overnight at 37°C) by the addition of 16 jzl H 2 0 , 30 fil lOx 

35 "One-Phor-All" buffer (Pharmacia) (100 mM Tris-Acetate pH 
7.5, 100 mM MgAc 2 , 500 mM KAc) and 4 fil (- 40 U) EcoRI. 
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PCR.III: 

10 nrtM Tris-Acetate pH 7.5, 10 mM MgAc 2 , 50 iriM KAc (lx 
"One-Phor-All" buffer {Pharmacia}), 4 /xl (= 40 U) Nhel, 
4 fil (» 40 U) Sad, in 100 /il total volume at 37 °C. 
5 PCR.IV : 

20 mM Tris-Acetate pH 7.5, 20 mM MgAc 2 , 100 mM KAc (2x 
"One-Phor-All w buffer {Pharmacia}), 4/il ( = 40 U) Xhol, 4 fil 
(= 40 U) EcoRl, in 100 /il total volume at 37°C. 
PCR.V: 

10 20 mM Tris-Acetate pH 7.5, 20 mM MgAc 2 , 100 ntM KAc (2x 
"One-Phor-All n buffer {Pharmacia}), 4 /xl (= 40 U) Sail, 
4 fil (= 40 U) ficoRI, in 100 /xl total volume at 37°C. 
PCR.VI : 

10 mM Tris-Acetate pH 7.5, 10 mM MgAc 2 , 50 mM KAc {lx 
15 "One-Phor-All" buffer {Pharmacia}), 4 /xl {= 48 U) Sfil, in 
50 /xl total volume at 50°C under mineral oil. After 
overnight digestion, PCR.VI- Sfil was digested with Miel 
(overnight at 37°C) by the addition of 41 /il H 2 0, 5 /xl lOx 
"One-Phor-All" buffer (Pharmacia) (100 mM Tris-Acetate pH 
20 7.5, 100 mM MgAc 2 , 500 mM KAc) and 4 /xl (= 40 U) Nhel. 
PCR.VIT : 

50 mM Tris-HCl, pH 8.0, 10 mM MgCl 2 , 50 mM NaCl, 4 mM 
spermidine, 0.4 fiq/wl BSA, 4 /xl (= 40 U) Nhel, 4 fil 
(= 40 U) BstEII, in 100 /xl total volume at 37°C. 
25 PCR.VIIT : 

20 mM Tris-Acetate pH 7.5, 20 mM MgAc 2 , 100 mM KAc (2x 
H One-Phor-All w buffer {Pharmacia}), 4 /xl (= 40 U) BcoRI, in 
50 /il total volume at 37°C. After overnight digestion, 
PCR.VIIl-BcoRI was digested with Xhol (overnight at 37°) by 
30 the addition of 46 /il HjO and 4 fil (= 40 U) Xhol. 
PCR.IX : 

25 mM Tris- Acetate, pH 7.8, 100 IriM KAc, 10 mM MgAc, UrtM DTT 
(lx "Multi-Core" buffer {Promega}), 4 mM spermidine, 0.4 
M9/nil BSA, 4 /xl ( = 40 U) Nhel, 4 fil (= 40 U) BstEII, in 100 
35 /il total volume at 37°C. 
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P C Rt X : 

50 mM Tris-HCl, pH 8.0, 10 mM MgCl 2/ 50 irtM NaCl, 4 mM 
spermidine, 0.4 /xg/ml BSA, 4 /zl (= 40 U) Ps tl, 4 /il 
(= 40 U) EcoRI, in 100 fil total volume at 37°C. 

5 

The digested PCR fragments 

P CR . I - Sa cl / B s tE 1 1 , P CR . 1 1 - Sfi I / £coR I , 

PCR.III-Jffitel/SacI, PCR . IV - Xhol / EcoRl , 

PCR . V- Sall/EcoRl , PCR .VI - Sfll/NheX , 

10 PCR.VII-BstEII/Afcel, PCR . VII I - Xhol / EcoRT , 

PCR . IX - Ba tETT/NheT , and PCR . X - Ps tl/ EcoRT 

were purified on an 1.2% agarose gel using DEAE- membranes 
NA45 (Schleicher & Schuell) as described by Mania t is et al. 
The purified fragments were dissolved in H 2 0 at a 

15 concentration of 100-150 ng/jil. 

_ Construction of the pGOSA double head expression vectors. 

The construction of pGOSA.E (see Figure 8) involved several 
cloning steps that produced 4 intermediate constructs 

20 pGOSA.A to pGOSA.D (see Figure 30-34) . The final expression 
vector pGOSA.E and the oligonucleotides in Table 1 above 
have been designed to enable most specificities to be 
cloned into the final pGOSA.E construct (Figure 9) . The 
upstream V H domain can be replaced by any Pstl-BstEII V H 

25 gene fragment obtained with oligonucleotides PCR. 51 and 

PCR. 89 (see Table 1 above). The oligonucleotides DBL.3 and 
DBL.4 (see Table 1 above) were designed to introduce Sfil 
and Nhel restriction sites in the V H gene fragments thus 
allowing cloning of those V H gene fragments into the Sfil- 

30 Nhel sites as the downstream V H domain. All V L gene 

fragments obtained with oligonucleotides PCR. 116 and PCR. 90 
(see Table 1 above) can be cloned into the position of the 
V L .34l8 gene fragment as a Sad -Xhol fragment. A 
complication here however is the presence of an internal 

35 SacI site in the V H .3418 gene fragment. Oligonucleotides 
DBL.8 and DBL.9 (see Table 1 above) are designed to allow 
cloning of V L gene fragments into the position of the 
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V L .4715 gene fragment as a Sall-NotI fragment. The pGOSA.E 
derivatives pGOSA.V, pGOSA.S and pGOSA.T with only one or 
no linker sequences contain some aberrant restriction sites 
at the new joining points. The V„A-V H B construct without a 
5 linker lacks the 5' V H B Sfil site. The V H B fragment is 
cloned into these constructs as a BstEII/Mhel fragment 
using oligonucleotides DBL.10 or DBL.ll and DBL.4 (see 
Table 1 above) . The V L B-V L A const ruct wit hout a linker lacks 
the 5' V L A Sail si"t^.~~Tfie~iygr fragment is cloned i nto these 
10 constructs as a Xhol/EcoRl fragment us in g olig onucleotides 
DBL.ll and DBL.9 (see Table -l above) J 

In the following part of the description the following 
linkers are mentioned which are also present in the 

15 sequence listing: 

the (Gly 4 Ser) 3 linker, present in SEQ ID NO: 2 as amino 
acids 109-123 and SBQ ID NO: 10 as amino acids 121-135, 
the (Gly 4 Ser) jAlaGlySerAla linker (= linkerA), present in 
SEQ ID NO: 12 as amino acids 121-139, and 

20 the (Gly 4 Ser) 2 Gly 4 Val linker (= linkerV) , present in SEQ ID 
NO: 13 as amino acids 108-122. 

pGOSA.A 

This plasmid is derived from both the Pv.4715-myc construct 

25 and the scFv.4715-myc construct. 

An Sfil restriction site was introduced between the DNA 
sequence encoding the (Gly 4 Ser) 3 linker and the gene 
fragment encoding the V L of the scFv.47l5-myc construct (see 
Figure 30) . This was achieved by replacing the BstEIl-SacI 

30 fragment of the latter construct by the fragment PGR- 1 

BstEII/SacI (Figure 20) that contains an Sfil site between 
the DNA encoding the (Gly 4 Ser) 3 linker and the V L .4715 gene 
fragment. The introduction of the Sfil site also introduced 
4 additional amino acids (AlaGlySerAla) between the 

35 (Gly 4 Ser) 3 linker and V L .4715 resulting in a 
(Gly 4 Ser)3AlaGlySerAla linker (linkerA). The 
oligonucleotides used to produce PCR-I (DBL.l and DBL.2, 
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see Table l above) were designed to match the sequence of 
the framework- 3 region of V H .4715 and to prime at the 
junction of the DNA encoding the (Gly 4 Ser) 3 linker and the 
V L .47l5 gene fragment, respectively. Thus pGOSA.A can be 
5 indicated as: peIB-V H 4715-linkerA- (Sfil) -V L 4715-myc. 

pGOSA.B 

This plasmid is derived from plasmid Fv.3418 (see Figure 
31). The XhoI-EcoRI fragment of plasmid Fv.3418 comprising 
10 the 3' end of DNA encoding framework- 4 of the V L including 
the stop codon was removed and replaced by the fragment 
PCR-IV XhoZ/EcoRl (Figure 23) . The oligonucleotides used to 
produce PCR-IV (DDL . 6 and DBL.7, see Table 1 above) were 
designed to match the sequence at the junction of the V L and 
15 the (Gly 4 Ser) 3 linker perfectly (DBL.6) , and to be able to 
prime at the junction of the (Gly 4 Ser) 3 linker and the V H in 
pUR.4124 (DBL.7). DBL.7 removed the PstI site in the V H 
(silent mutation) and introduced a Sail restriction site at 
the junction of the (Gly 4 Ser) 3 linker and the V H , thereby 
20 replacing the last Ser of the linker by a Val residue 
resulting in a (Gly 4 Ser) 2 Gly 4 Val linker (linkerV) . Thus 
pGOSA.B can be indicated as: 

pe2B-V H 3418 + peIB-V L 3418-linkerV- (Sall-KcoRI) 

pGOSA.C 

This plasmid contains DNA encoding V H .47l5 linked by the 
(Gly 4 Ser)3AlaGlySerAla linker to V H .3418 (see Figure 32), 
thus : pelB-V H 4715- linkerA-V M 3418 . 

This construct was obtained by replacing the Sfil-BcoRI 
fragment from pGOSA.A encoding V L .47l5 by the fragment PCR- 
II Sfil/EcdRl containing the V H .3418 gene (see Figure 21). 
The oligonucleotides used to produce PGR- II (DBL.3 and 
DBL . 4 , see Table 1 above) hybridize in the framework- 1 and 
framework- 4 region of the gene encoding V H .3418, 
respectively. DBL.3 was designed to remove the PstI 
restriction site (silent mutation) and to introduce an Sfil 
restriction site upstream of the V H gene. DBL. 4 destroys 



25 



30 
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the BstEII restriction site in the framework- 4 region and 
introduces an Nhel restriction site downstream of the 
stopcodon. 

5 pGOSA.D 

This plasmid contains a dicistronic operon comprising the 
V„.34l8 gene and DNA encoding V L .3418 linked by the 
(Gly 4 Ser) 2 Gly 4 Val linker to V L .4715 (see Figure 33), thus: 
peIB-V H 3418 + pe2B-V L 3418-linkerV-V L 4715. 

10 This construct was obtained by digesting plasmid pGOSA.B 

with Sall-EcoRI and inserting the fragment PCR-V SalX/EcoRI 
(Figure 24) containing the V L .4715 gene. The 
oligonucleotides used to obtain PCR-V (DBL.8 and DBL.9, see 
Table 1 above) were designed to match the nucleotide 

15 sequence of the framework- 1 and framework -4 regions of the 
V L .4715 gene, respectively. DBL.8 removed the SacI site from 
the framework- 1 region (silent mutation) and introduced a 
Sail restriction site upstream of the V L .4715 gene. DBL.9 
destroyed the Xhol restriction site in the framework-4 

20 region of the V L .47l5 gene (silent mutation) and introduced 
a ItotI and an EcoRI restriction site downstream of the stop 
codon . 

pGOSA.E 

25 This plasmid contains a dicistronic operon comprising DNA 
encoding V H .47l5 linked by the (Gly 4 Ser)3AlaGlySerAla linker 
to V„.3418 plus DNA encoding V L .3418 linked by the 
(Gly 4 Ser) 2 Gly 4 Val linker to V L .4715 (see Figure 34), thus: 
pelB-V H 4715-linkerA-V H 3418 + pelB-V L 3418-linkerV-V L 4715 . 

30 Both translational units are preceded by a ribosome binding 
site and DNA encoding a pelB leader sequence. This plasmid 
was obtained by a three-point ligation by mixing the vector 
resulting from pGOSA.D after removal of the V H 3418 -encoding 
Pstl-Sacl insert with the Pstl-fltoel pGOSA.C insert 

35 containing V H .4715 linked to V H .3418 and the PCR-III 

Whel/SacI fragment (see Figure 22) . The remaining Pstl-SacI 
pGOSA.D vector contains the 5' end of the framework- 1 
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region of V H .3418 upto the Patl restriction site and V L .3418 
linked by the (Gly 4 Ser) 2 Gly 4 Val linker to V L .4715 starting 
from the Sad restriction site in V L .3418. The Pstl-Miel 
pGOSA.C insert contains V H .4715 linked by the (Gly 4 Ser) 3 - 
5 AlaGlySerAla linker to V H .3418 # starting from the PstI 

restriction site in the framework- 1 region in V H .4715. The 
Nhel-SacI PCR-III fragment provides the ribosome binding 
site and DNA encoding the pelB leader sequence for the 
V L . 3418 - (Gly 4 Ser) 2 Gly 4 Val-V L . 4715 construct. The oligo- 
10 nucleotides DBL.5 and PCR.116 (see Table 1 above) used to 
generate PCR-III were designed to match the sequence 
upstream of the ribosome binding site of V L .47l5 in Fv.4715 
and to introduce an Nhel restriction site (DBL.5), and to 
match the framework-4 region of V L .3418 (PCR.116). 

15 

pGOSA.V 

This plasmid is derived from pGOSA.E (see Figure 35) from 
which the BstEII/Whel fragment containing DNA encoding 
linkerA-V H .34l8 was excised and replaced by the fragment 

20 PCR-VII BstEXX/Nhel containing the V H .3418 gene (see Figure 
26). The resulting plasmid pGOSA.V contains V H .3418 linked 
directly to the framework-4 region of V H .47l5, plus V L .4715 
linked by the (Gly 4 Ser) 2 Gly 4 Val linker to the framework-4 
region of V L .3418, thus: 

25 pelB-V H 4715*V H 3418 + pelB-V L 3418-linkerV-V L 4715 

pGOSA.S 

This plasmid is derived from pGOSA.E (see Figure 36) from 
which the (Gly 4 Ser)2Gly 4 Val-V L 4715 Xhol/EcoRI fragment was 

30 excised and replaced by the fragment PCR-VIII Xhol/EcoRI 
which contains V L .4715 (see Figure 27) . The resulting 
plasmid pGOSA.S contains V H .4715 linked by the (Gly 4 Ser) 3 - 
AlaGlySerAla linker to V H .3418 plus V L .3418 linked directly 
to the 5' end of the framework- 1 region of V L .4715, thus: 

35 pelB-V H .4715-linkerA-V H .3418 + pelB-V L .3418*V L .4715 . 
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pGOSA.T 

This plasmid contains a dicistronic operon consisting of 
V H .3418 directly to the framework- 4 region of V H .4715 plus 
V L .3418 linked directly to the 5' end of the framework-l 
region of V L .47l5 (see Figure 37) . Both transcriptional 
units are preceded by a ribosome binding site and a pelB 
leader sequence, thus: 

jpeIB-V H .4715*V„.3418 + pelB- V, .3418»V T .47J3 *. 
This construct was obtained by inserting the Nhel/EcoRl 
fragment of pGOSA.S which contains V L .3418 linked directly 
to the 5' end of the framework- 1 region of V L .47l5, into the 
vector pGOSA.V from which the Nhel/EcoRl fragment 
containing V L . 3418 linked by the (Gly 4 Ser) 2 Gly 4 Val linker to 
V L .4715 was removed. 



EXAMPLE 6. Detailed description of the preparation of 

other dicistronic constructs pGOSA.G, and 
— PGQSA.J. pGOSA.Z, pGOSA.AA and pGQSA.AB 

pGOSA.G 

This plasmid is an intermediate for the synthesis of 
pGOSA.J. It is derived from pGOSA.E from which the V H 4715 
Pstl/BstEII fragment has been excised and replaced by the 
V H 3418 Pstl/BstEII fragment (excised from Pv.3418). The 
resulting plasmid pGOSA.G (see Figure 38) contains two 
copies of V H .3418 linked by the (Gly^erJ^AlaGlySerAla 
linker, plus V L .4715 linked by the (Gly 4 Ser)2Gly 4 Val linker 
to the framework- 4 region of V L .3418, thus: 

pelB-V H .3418-linkerA-V H .3418 + pelB-V L . 3418 -linkerV-V L . 4715 . 
pGOSA.J 

This plasmid contains a dicistronic operon consisting of 
V H .3418 linked by the (Gly 4 Ser ) jAlaGlySerAla linker to 
V H .4715 plus V L .3418 linked by the (Gly 4 Ser) jGly^al linker 
to V L .4715. Both transcriptional units are preceded by a 
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ribosome binding site and a pelB leader sequence (see 
Figure 39) , thus: 

peIB-V B .3418-linkerA-V H .4715 + pelB-V L . 3418 -linkerV-V L . 4715 . 

5 This construct was obtained by inserting the fragment PCR- 
VI Sfil/Nhel which contains V H 4715 (Figure 25) , into the 
vector pGOSA.G from which the Sfil/Nhel V H 3418 fragment was 
removed. 



10 pGOSA.Z 

This plasmid is derived from pGOSA.G from which the 
(Gly^erJjAlaGlySerAla linker-V H 3418 BstElZ/Nhel fragment 
was excised and replaced by the fragment PCR-IX BstEXl/Nhel 
which contains V H .4715 (Figure 28) . The resulting plasmid 
15 pGOSA.Z (see Figure 40) contains V H .3418 linked directly to 
the framework- 1 region of V H .4715, plus V L .4715 linked by 
the (Gly 4 Ser) 2 Gly 4 Val linker to the framework- 4 region of 
V L .3418, thus: 

peIB-V H ,3418*V H .4715 + pelB-V L . 3418 -linkerV-V L . 4715 • 

20 

pGOSA*AA 

This plasmid contains a dicistronic operon consisting of 
the V H .3418 linked directly to the 5' end of the framework - 
1 region of V H .4715 plus V L .3418 linked directly to the 5' 

25 end of the framework- 1 region of V L .47l5. Both 

transcriptional units are preceded by a ribosome binding 
site and a pelB leader sequence (see Figure 41) . This 
construct was obtained by inserting the Nhel/BcoKl fragment 
of pGOSA.T which contains V L .3418 linked directly to the 5' 

30 end of the framework- 1 region of V L .4715, into the vector 
pGOSA.Z from which the Nhel/EcoRI fragment containing 
V L .3418 linked by the (Gly 4 Ser)2Gly 4 Val linker to V L *4715 was 
removed, thus: 

pe2B-V H .3418*V H .4715 + pelB-V L .3418*V L .4715 . 



W097A14719 



PCT/EP96/036O5 



31 

pGOSA.AB 

This plasmid is derived from pGOSA.J by a three point 
ligation reaction (see Figure 42). The SacI/EcoRI insert, 
containing part of V H .3418 and the full 
5 (Gly 4 Ser) 3 AlaGlySerAla linker- V„. 4715 and the V L .3418- 

(Gly 4 Ser) 2 Gly 4 Val-V L .4715 encoding sequences, was removed and 
replaced by the Sad/ Sad pGOSA. J fragment containing the 
same part of V H .3418 and the full (Gly 4 Ser) aAlaGlySerAla 
linker- V H . 4715 and the Sad/EcoRI pGOSA.T fragment 

10 containing V L .3418 linked directly to the framework- 1 region 
of V L .47l5 (see Figure 37). The resulting plasmid contains 
V H .3418 linked by the (Gly 4 Ser) > AlaGlySerAla linker to the 
5' end of the framework- 1 region of V H .4715 plus V L .3418 
linked directly to the 5' end of the framework- 1 region of 

15 V L .4715, thus: 

peIB-V H .3418-linkerA-V H .4715 + peIB-V L .3418*V L .4715 . 

EXAMPLE 7. Detailed description of the preparation of 
20 monocistronic constructs pGOSA.L and pGOSA.Y, 
and pGOSA.C. pGOSA.X. pGQSA.AC and pGOSA.AD 



pGOSA.L 

This plasmid is derived from pGOSA.E from which the 
25 ttindlll/Miel fragment containing DNA encoding V H .4715- 

(Gly 4 Ser)3AlaGlySerAla-V H .3418 was removed (see Figure 43) . 
The DNA ends of the vector were made blunt -end using Klenow 
DNA polymerase and ligated. The resulting plasmid pGOSA.L 
contains V L .3418 linked by the (Gly 4 Ser)2Gly 4 Val linker to 
30 the 5 # end of the framework- 1 region of V L .47l5, thus: 

pelB-V L .3418-linkerV-V L .4715. 

pGOSA.T 

This plasmid is derived from pGOSA.T from which the 
35 Hindlll/Nhel fragment containing DNA encoding V H .4715- 
V H .3418 was removed (see Figure 44) . The DNA ends of the 
vector were made blunt -end using Klenow DNA polymerase and 
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ligated. The resulting plasmid pGOSA.Y contains V L .3418 
linked directly to 5' end of the framework-l region of 
V L .4715, thus: 

pelB-V L .3418*V L .4715. 

5 

The preparation of pGOSA.C was given in Example 5 above; it 
can be indicated with: pelB- V H . 4715 -linkerA-V H . 3418 . 

pGOSA.X 

10 This plasmid is derived from pGOSA.T from which the 

NheX/EcoRI fragment containing DNA encoding V L . 3418-V L .4715 
was removed. The DNA ends of the vector were made blunt -end 
using Klenow DNA polymerase and ligated. The resulting 
plasmid pGOSA.X (see Figure 45) contains V H .4715 linked 

15 directly to 5' end of the framework-l region of V H .3418, 
thus : pelB-V H . 4715*V H . 3418 . 

pGOSA.AC 

This plasmid is derived from pGOSA.Z from which the 
20 Nhel/EcoRI fragment containing DNA encoding V L .3418- 

(Gly 4 Ser) 2 Gly 4 Val-V L ,4715 was removed (see Figure 46) . The 
DNA ends of the vector were made blunt -end using Klenow DNA 
polymerase and ligated. The resulting plasmid pGOSA.AC 
contains V H .3418 linked directly to 5' end of the 
25 framework-l region of V H .4715, thus: 

pelB-V H ^3418*V H .4715. 

pGOSA.AD 

This plasmid was obtained by inserting the Pstl/EcoRI PCR.X 
30 fragment containing DNA encoding V H . 3418- (G^SerJjAlaGly- 
SerAla-V H .4715 (see Figure 29) into the Fv.4715-myc vector 
from which the Pstl/BcoRl Fv.47l5-myc insert was removed 
(see Figure 47) , thus: peIB-V H . 3418 -linkerA-V B . 4715 . 

35 These monocistronic constructs can be used to transform the 
same host with two different plasmids or to transform two 
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different hosts, so tiiat the two V H 's in series can be 
produced separately from the two V L 's in series. 



Evaluation of the results obtained 
5 Bifunctional binding activity of GOSA double heads 

In this specification the construction of a two chain 
protein complex is described, in which one of the chains 
consists of two heavy chain V- domains and the other chain 
consists of the two corresponding light chain V-domains. 

10 The variable domains are linked either directly or through 
a polypeptide 1 inker . In this specification evidence is 
provided that these type of molecules ("double heads") 
contain both antigen binding specificities of the Pv's used 
to generate these multi-functional antibody fragments. 

15 Figure 12 shows that GOSA.E can be used to specifically 

target the enzyme glucose oxidase to several Streptococcus 
sanguis strains, using antibody fragments derived from 
hybridomas expressing antibodies directed against these 
antigens. Figure 12 further shows that the fine specificity 

20 of the ant i- Streptococcus sanguis scFv 4715 is preserved in 
the GOSA.E double head. 



Effect of linkers and relative position of V-domains on 
double head activity 

25 After it was shown that the "cross -over double -head" 
approach (V„A-V H B + V L B-V L A) yields active bispecif ic 
molecules, the importance of the relative position of the 
V-domains in these constructs was investigated. Both 
possible positional orientations (GOSA.E = V„A-LinkerA-V H B + 

30 V L B-LinkerV-V L A and GOSA. J - V H B-LinkerA-V„A + V L B-LinkerV- 
V L A) were constructed and tested for bispecif ic activity, 
despite the suggestion obtained by molecular modelling that 
the binding site formed by the second (downstream/C- 
terminal) V-domains in the configuration V H B-V„A + V L B-V L A 

35 (GOSA. J) was in an unfavourable position for binding to 
large protein antigens on the surface of cells. 
Surprisingly however, it was found experimentally that the 
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downstream binding site is in fact accessible. Although the 
relative position of the heavy chains and the light chains 
was found to have an effect on the observed reactivity both 
tested combinations show bispecific activity with the 
5 11 cross -over" combination (GOSA.E = V„A-V H B + V L B-V L A) 

exhibiting a higher level of reactivity compared to the 
combination V H B-V„A + V L B-V L A (= GOSA.J) as demonstrated for 
A=anti -Strep and B«anti-Gox. 

10 Molecular modelling of the V H B-V„A + V L B-V L A (= GOSA.J) 
configuration further suggested that , only when the 
connecting linkers are kept long enough (to span 30 to 
35 A) , the protein chains could fold such that both binding 
sites are fully accessible. 

15 The "cross -over" configuration: VhA-V h B + V L B-V L A (GOSA.E) 
wherein linker length was not critical, was predicted to 
result in a complex with both binding sites facing in 
opposite directions, without the restraints suggested for 
the configuration V h B-VhA + V L B-V L A (GOSA.J) . 

20 Removing the flexible polypeptide linker from the V„A-V M B 
chain only had a minimal effect on the ability of the 
double head in the "cross -over" configuration (GOSA.V = 
V„A*V H B + V L B-V t A) to bind both S. sanguis and Glucose 
oxidase. However, removing the flexible polypeptide linker 

25 from the V H B-V„A chain from the molecule in the V H B-V„A + 
V L B-V L A configuration (GOSA.Z = V H B*V„A + V L B-V L A ) resulted 
in a dramatic reduction of its ability to bind both S. 
sanguis and Glucose oxidase. 

In contrast with the double head in the "cross -over" 
30 configuration without the flexible polypeptide linker 
between the two heavy chain domains (GOSA.V) , where 
molecular modelling predicted the resulting molecule to be 
active, removal of the flexible linker from the V L B-V L A 
chain could not be modelled such that both binding sites 
35 were fully accessible. ELISA results confirm that the 

double head in the V H B-V„A + V L B-V L A configuration without a 
linker between the two light chain domains (GOSA.AB) 
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exhibits only minimal S. sanguis and glucose oxidase 
binding activity. Surprisingly, deletion of the flexible 
linker from the V L B-V L A chain from the double head in the 
"cross -over" configuration (GOSA.S) only had a small effect 
5 on the bispecific activity of the resulting molecule. 

As expected from the molecular modelling results from the 
double heads without a flexible linker between the two 
light chain domains, removal of both the flexible 
polypeptide linkers from the double head molecules, could 

10 not be modelled such that both binding sites were fully 
accessible. In agreement with the ELISA results obtained 
with the GOSA.AB construct, the double head in the V H B-V„A + 
V L B-V L A configuration without any linkers (GOSA.AA) only 
exhibits minimal if any S. sanguis and glucose oxidase 

15 binding activity. Surprisingly, the double head in the 
"cross-over" configuration without any linkers (GOSA.T = 
V„A*V H B + V L B*V L A) still exhibited 25-50% of S. sanguis and 
glucose oxidase bispecific binding activity when compared 
to the activity of the double head in the "cross-over" 

20 configuration with two linkers {GOSA.E) . 

Thus the conclusion of this work is that modelling can give 
some indications, but that the computer programmes cannot 
predict what is possible and what not. Several deviations 
from the modelling expectations were found. With a 

25 paraphrase on an old saying: theories are nice but the 
experiment is the ultimate proof. 

Sensitivity of GOSA double heads 

Using an ELISA format it was shown that the sensitivity of 
30 the GOSA.E double head is as least as a sensitive as an 

IgG-glucose oxidase conjugate, as determined by the lowest 
concentration of Streptococcus sanguis antigen immobilised 
on a solid phase that is still detectable. 

35 GOSA double heads are produced as dimers 

PPLC analysis of partially affinity-purified GOSA.E, 
GOSA.V, GOSA.S and GOSA.T samples usually gave only one 
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GOSA double head activity peak as determined by ELISA 
(Figures 16-19). The position of this peak in the elution 
pattern indicated that the molecular weight of the GOSA 
double head is 40-50 kD. Since this molecular weight 
5 corresponds to the expected molecular weight of the V H 2 + 
V L 2 double head dimer, it was concluded that GOSA.E, GOSA.V, 
GOSA.S and GOSA.T are primarily produced as dimeric 
molecules. Occasionally an activity peak with an apparent 
molecular weight of ~200 kD was observed (Figure 16) . The 
10 presence of glucose oxidase activity in these fractions 
indicate that these fractions contain GOSA double head 
complexed with glucose oxidase. 

In vitro assembly of GOSA double heads 

15 It was shown that bi functionally active dimeric GOSA 

molecules together in one cell can be produced by transla- 
tion from one dicistronic messenger (GOSA.E, GOSA.S, 
GOSA.T, GOSA.V, GOSA. J, GOSA.AB, GOSA. AA and GOSA.Z) . In 
addition high levels of S. sanguis and glucose oxidase 

20 bispecific binding activity is formed when supernatant s of 
cultures producing the separate GOSA subunits are mixed 
(see Example 7) . The effects of linkers and the relative 
position of the individual V H -domains on the S. sanguis and 
glucose oxidase bispecific binding activity observed in 

25 these mixing experiments are comparable to the dicistronic 
constructs. 

The constructs described above are summarised in Table 2 
below . 

30 Table 2A describes intermediate constructs that were not 
further tested. 

Table 2B describes the dicistronic constructs. 
Table 2C describes the monocistronic constructs. 

(Li A) stands for the V H -V H linker (Gly 4 Ser);>AlaGlySerAla 
35 (= linkerA) 

(LiV) stands for the V L -V L linker (Gly 4 Ser) 2 Gly 4 Val 
(= linkerV) 
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(*) indicates that the two heavy chain domains or the 
two light chain domains are fused together without 
a connecting linker. 



Table 2. 



Table 2A 
60S A. A 
10 GOSA.B 
60SA.D 
GOSA.G 



V H .4715-LiA- (Sfil) -V L .4715-myc 
V H .3418-LiV-V L .3418- (Sall/BcdRl) 
V H .3418 + V L . 3418 -LiV-V L . 4715 
V„.3418-LiA-V H .3418 + V u . 3418-LiV- V L .4715 



Table 2B 

15 GOSA.E 
GOSA.S 
GOSA.T 
GOSA.V 



V H .4715-LiA-V H .3418 + V L . 3418 -LiV-V L . 4715 
V„.4715-LiA-V H .3418 + V L . 3418*V L . 4715 
V H .4715*V H .3418 + V L .3418*V L .4715 
V H .4715*V„.3418 + V L . 3418 -LiV-V L . 4715 



20 



GOSA.J 
GOSA.AB 
GOSA.AA 
GOSA.Z 



V H .3418-LiA-V H .4715 + V L . 3418-LiV-V L .4715 
V H .3418-LiA-V H .4715 + V L . 3418*V L .4715 
V H .3418*V H .4715 + V L .3418*V L .4715 
V H .3418*V H .4715 + V L .3418-LiV-V L .4715 



25 Table 2C 

GOSA.L : 
GOSA.Y : 



V L . 3418 - LiV- V L .4715 
V L .3418*V L .4715 



30 



GOSA.AD 
GOSA.AC 
GOSA.C 
GOSA.X 



V H . 3418 -LiA-V H . 4715 

V„.3418*V H .4715 

V H . 4715 - LiA- V„ . 3418 

V H .4715*V H .3418 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Unilever PLC 

(B) STREET: Unilever House, Blackfriars 

(C) CITY: London 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP) : EC4P 4BQ (GB) 

(A) NAME: Unilever N.V. 

(B) STREET: Weena 455 

(C) CITY: Rotterdam 

(E) COUNTRY: The Netherlands 

(F) POSTAL CODE (ZIP) : NL-3013 AL 

(A) NAME: Paul James DAVIS 

(B) STREET: The Hawthorns, Pavenham Road 

(C) CITY: Felmersham (Bedfordshire) 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP) : MK43 7 EX (GB) 

(A) NAME: Come lis Paul Erik van der LOGT 

(B) STREET: 1 Bluebell Rise (Peverel Manor Estate) 

(C) CITY: Rushden (Northamptonshire) 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP) : NN10 OTU (GB) 

(A) NA ME: M artine Elisa VERHOEIJEN 

(B) STREET: 1 Tintagel Close (Manor Farm Estate) 

(C) CITY: Rushden (Northamptonshire) 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP) : NN10 ONP (GB) 

(A) NAME: Steve Wilson 

(B) STREET: 3 Aldenham Close (Goldington ) 

(C) CITY: Bedford, 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP) : MK41 OFQ (GB) 

(ii) TITLE OF INVENTION: A bifunctional or bivalent 

antibody fragment analogue 

(iii) NUMBER OF SEQUENCES: 31 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC* DOS /MS* DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(v) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: BP 95307332.7 



(2) INFORMATION FOR SBQ ID NO: 1: 

(i) SEQUEN CE CH ARACTERISTICS: 

(A) LENGTH: 737 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "cDNA domains with synthetic linker (s) " 
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(vii) IMMEDIATE SOURCE: 

(B) CLONE: EcoRI-Hindlll insert of pUR4124 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 11. .730 

<D) OTHER INFORMATION: /product = "VLlys-GS-VHlys" 

( ix) FEATURE : 

(A) NAME/KEY: xnat_peptide 

(B) LOCATION: 11. .334 

(D) OTHER INFORMATION: /products "VLlys" 

(ix) FEATURE: 

(A) NAME /KEY: misc_RNA 

(B) LOCATION: 335. .379 

(D) OTHER INFORMATION: /product = " <Gly4Ser) 3 linker" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION:380. .727 

(D) OTHER INFORMATION : / product = "VHlys" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GAATTCGGCC GAC ATC GAG CTC ACC CAG TCT CCA GCC TCC CTT TCT GCG 49 
Asp lie Glu Leu Thr Gin Ser Pro Ala Ser Leu Ser Ala 
1 5 10 

TCT GTG GGA GAA ACT GTC ACC ATC ACA TGT CGA GCA AGT GGG AAT ATT 97 
Ser Val Gly Glu Thr Val Thr lie Thr Cys Arg Ala Ser Gly Asn lie 
15 20 25 

CAC AAT TAT TTA GCA TGG TAT CAG CAG AAA CAG GGA AAA TCT CCT CAG 145 
His Asn Tyr Leu Ala Trp Tyr Gin Gin Lys Gin Gly Lys Ser Pro Gin 
30 35 40 45 

CTC CTG GTC TAT TAT ACA ACA ACC TTA GCA GAT GGT GTG CCA TCA AGG 193 
Leu Leu Val Tyr Tyr Thr Thr Thr Leu Ala Asp Gly Val Pro Ser Arg 
50 55 60 

TTC AGT GGC AGT GGA TCA GGA ACA CAA TAT TCT CTC AAG ATC AAC AGC 241 
Phe Ser Gly Ser Gly Ser Gly Thr Gin Tyr Ser Leu Lys lie Asn Ser 
65 70 75 

CTG CAA CCT GAA GAT TTT GGG AGT TAT TAC TGT CAA CAT TTT TGG AGT 289 
Leu Gin Pro Glu Asp Phe Gly Ser Tyr Tyr Cys Gin His Phe Trp Ser 
80 85 90 

ACT CCT CGG ACG TTC GGT GGA GGG ACC AAG CTC GAG ATC AAA CGG GGT 337 
Thr Pro Arg Thr Phe Gly Gly Gly Thr Lys Leu Glu lie Lys Arg Gly 
95 100 105 

GGA GGC GGT TCA GGC GGA GGT GGC TCT GGC GGT GGC GGA TCG CAG GTG 385 
Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gin Val 
110 115 120 125 

CAG CTG CAG GAG TCA GGA CCT GGC CTG GTG GCG CCC TCA CAG AGC CTG 433 
Gin Leu Gin Glu Ser Gly Pro Gly Leu Val Ala Pro Ser Gin Ser lieu 
130 135 140 

TCC ATC ACA TGC ACC GTC TCA GGG TTC TCA TTA ACC GGC TAT GGT GTA 481 
Ser He Thr Cys Thr Val Ser Gly Phe Ser Leu Thr Gly Tyr Gly Val 
145 150 155 
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AAC TGG GTT CGC CAG CCT CCA GGA AAG GGT CTG GAG TGG CTG GGA ATG 529 
Asn Trp Val Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp Leu Gly Met 
160 165 170 

ATT TGG GGT GAT GGA AAC ACA GAC TAT AAT TCA GCT CTC AAA TCC AGA 577 
He Trp Gly Asp Gly Asn Thr Asp Tyr Asn Ser Ala Leu Lys Ser Arg 
175 180 185 

CTG AGC ATC AGC AAG GAC AAC TCC AAG AGC CAA GTT TTC TTA AAA ATG 625 
Leu Ser He Ser Lys Asp Asn Ser Lys Ser Gin Val Phe Leu Lys Met 
190 195 200 205 

AAC AGT CTG CAC ACT GAT GAC ACA GCC AGG TAC TAC TGT GCC AGA GAG 673 
Asn Ser Leu His Thr Asp Asp Thr Ala Arg Tyr Tyr Cys Ala Arg Glu 
210 215 220 

AGA GAT TAT AGG CTT GAC TAC TGG GGC CAA GGG ACC ACG GTC ACC GTC 721 
Arg Asp Tyr Arg Leu Asp Tyr Trp Gly Gin Gly Thr Thr Val Thr Val 
225 230 235 

TCC TCA TGA TAAGCTT 737 
Ser Ser * 
240 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 240 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Asp lie Glu Leu Thr Gin Ser Pro Ala Ser Leu Ser Ala Ser Val Gly 
1 5 10 15 

Glu Thr Val Thr lie Thr Cys Arg Ala Ser Gly Asn lie His Asn Tyr 
20 25 30 

Leu Ala Trp Tyr Gin Gin Lys Gin Gly Lys Ser Pro Gin Leu Leu Val 
35 40 45 

Tyr Tyr Thr Thr Thr Leu Ala Asp Gly Val Pro Ser Arg Phe Ser Gly 
50 55 60 

Ser Gly Ser Gly Thr Gin Tyr Ser Leu Lys He Asn Ser Leu Gin Pro 
65 70 75 80 

Glu Asp Phe Gly Ser Tyr Tyr Cys Gin His Phe Trp Ser Thr Pro Arg 
85 90 95 

Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys Arg Gly Gly Gly Gly 
100 105 no 

Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gin Val Gin Leu Gin 
115 120 125 

Glu Ser Gly Pro Gly Leu Val Ala Pro Ser Gin Ser Leu Ser He Thr 
130 135 140 

Cys Thr Val Ser Gly Phe Ser Leu Thr Gly Tyr Gly Val Asn Trp Val 
145 150 155 160 
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Arg Gin Pro Pro Gly Lys Gly Leu Glu Trp Leu Gly Met lie Trp Gly 
165 170 175 

Asp Gly Asn Thr Asp Tyr Asn Ser Ala Leu Lys Ser Arg Leu Ser lie 
180 185 190 

Ser Lys Asp Asn Ser Lys Ser Gin Val Phe Leu Lys Met Asn Ser Leu 
195 200 205 

His Thr Asp Asp Thr Ala Arg Tyr Tyr Cys Ala Arg Glu Arg Asp Tyr 
210 215 220 

Arg Leu Asp Tyr Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser * 
225 230 235 240 



{2) INFORMATION FOR SEQ ID HO: 3: 

(i) SEQUENCE CH ARACTERISTICS: 

(A) LENGTH: 920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "cDNA domains with synthetic linker (s) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hindi I I -EcoRI insert Fv.3418 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 36. .443 

(D) OTHER INFORMATION: /products »pelB-VH3418" 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 36. .101 

(D) OTHER INFORMATION: /products "pectate lyase" 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 102. .440 

(D) OTHER INFORMATION: /products "VH3418* 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 495 . „ 884 

(D) OTHER INFORMATION: /products •pelfi-VI^lS" 

(ix) FEATURE: 

(A) NAME/KEY: sig_jpeptide 

(B) LOCATION: 49 5. .560 

(D) OTHER INFORMATION: /products "pectate lyase " 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 561. .881 

(D) OTHER INFORMATION: /products "VL3418" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

AAGCTTGCAA ATTCTATTTC AAGGAGACAG TCATA ATG AAA TAC CTA TTG CCT 53 

Met Lys Tyr Leu Leu Pro 
-22 -20 
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ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA GCG ATG GCC X01 
Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met Ala 
-15 -10 -5 

CAG GTG CAG CTG CAG CAG TCA GGA CCT GAG CTG GTA AAG CCT GGG GCT 149 
Gin Val Gin Leu Gin Gin Ser Gly Pro Glu Leu Val Lys Pro Gly Ala 
1 5 io is 

TCA GTG AAG ATG TCC TGC AAG GCT TCT GGA TAC ACA TTC ACT AGC TAT 197 
Ser Val Lys Met Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser Tyr 
20 25 30 

GIT ATG CAC TGG GTG AAA CAG AAG CCT GGG CAG GGC CTT GAG TGG ATT 245 
Val Met His Trp Val Lys Gin Lys Pro Gly Gin Gly Leu Glu Trp He 
35 40 45 

GGA TAT ATT TAT CCT TAC AAT GAT GGT ACT AAG TAC AAT GAG AAG TTC 293 
Gly Tyr He Tyr Pro Tyr Asn Asp Gly Thr Lys Tyr Asn Glu Lys Phe 
50 55 60 

AAA GGC AAG GCC ACA CTG ACT TCA GAC AAA TCC TCC AGC ACA GCC TAC 341 
Lys Gly Lys Ala Ttxx Leu Thr Ser Asp Lys Ser Ser Ser Thr Ala Tyr 
65 70 75 80 

ATG GAG CTC AGC AGC CTG ACC TCT GAG GAC TCT GCG GTC TAT TAC TGT 389 
Met Glu Leu Ser Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cvs 
85 90 95 

TCA AGA CGC TTT GAC TAC TGG GGC CAA GGG ACC ACG GTC ACC GTC TCC 437 
Ser Arg Arg Phe Asp Tyr Trp Gly Gin Gly Thr Thr Val Thr Val Ser 
100 105 no 

TCA TAA TAAGAGCTAT GGGAGCTTGC ATGCAAATTC TATTTCAAGG AGACAGTCAT 493 
Ser * 

A ATG AAA TAC CTA TTG CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC 539 
Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu 
-22 -20 -is -io 

GCT GCC CAA CCA GCG ATG GCC GAC ATC GAG CTC ACC CAG TCT CCA TCT 587 
Ala Ala Gin Pro Ala Met Ala Asp He Glu Leu Thr Gin Ser Pro Ser 
5 15 

TCC ATG TAT GCA TCT CTA GGA GAG AGA ATC ACT ATC ACT TGC AAG GCG 635 
Ser Met Tyr Ala Ser Leu Gly Glu Arg He Thr He Thr Cys Lys Ala 
10 15 20 25 

AGT CAG GAC ATT AAT ACC TAT TTA ACC TGG TTC CAG CAG AAA CCA GGG 
Ser Gin Asp He Asn Thr Tyr Leu Thr Trp Phe Gin Gin Lys Pro Glv 
30 35 40 

AAA TCT CCC AAG ACC CTG ATC TAT CGT GCA AAC AGA TTG CTA GAT GGG 731 
Lys Ser Pro Lys Thr Leu He Tyr Arg Ala Asn Arg Leu Leu Asp Gly 
« 50 55 

GTC CCA TCA AGG TTC AGT GGC AGT GGA TCT GGG CAA GAT TAT TCT CTC 
Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Gin Asp Tyr Ser Leu 
60 65 70 

ACC ATC AGC AGC CTG GAC TAT GAA GAT ATG GGA ATT TAT TAT TGT CTA 827 
Thr He Ser Ser Leu Asp Tyr Glu Asp Met Gly He Tyr Tyr Cys Leu 
75 80 85 

CAA TAT GAT GAG TTG TAC ACG TTC GGA GGG GGG ACC AAG CTC GAG ATC 875 

I* ^ ^ Glu Leu Tyr Thr phe Gl y G1 y g1 y ' T * ir L y s giu ne 

90 95 100 105 



683 



779 
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AAA CGG TAA TAATGATCAA ACGGTATAAG GATCCAGCTC GAATTC 920 
Lys Arg * 



(2) INFORMATION FOR SBQ ID NO: 4: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 136 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 : 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
-22 -20 -15 -10 

Ala Gin Pro Ala Met Ala Gin Val Gin Leu Gin Gin Ser Gly Pro Glu 
-5 15 10 

Leu Val Lys Pro Gly Ala Ser Val Lys Met Ser Cys Lys Ala Ser Gly 
15 20 25 

Tyr Thr Phe Thr Ser Tyr Val Met His Trp Val Lys Gin Lys Pro Gly 
30 35 40 

Gin Gly Leu Glu Trp lie Gly Tyr lie Tyr Pro Tyr Asn Asp Gly Thr 
45 50 55 

Lys Tyr Asn Glu Lys Phe Lys Gly Lys Ala Thr Leu Thr Ser Asp Lys 
60 65 70 

Ser Ser Ser Thr Ala Tyr Met Glu Leu Ser Ser Leu Thr Ser Glu Asp 
75 80 85 90 

Ser Ala Val Tyr Tyr Cys Ser Arg Arg Phe Asp Tyr Trp Gly Gin Gly 
95 100 105 

Thr Thr Val Thr Val Ser Ser * 
110 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CH ARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
-22 -20 -15 -10 

Ala Gin Pro Ala Met Ala Asp lie Glu Leu Thr Gin Ser Pro Ser Ser 
-5 1 5 10 

Met Tyr Ala Ser Leu Gly Glu Arg lie Thr lie Thr Cys Lys Ala Ser 
15 20 25 

Gin Asp lie Asn Thr Tyr Leu Thr Trp Phe Gin Gin Lys Pro Gly Lys 
30 35 40 
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Ser Pro Lys Thr Leu He Tyr Arg Ala Asn Arg Leu Leu Asp Gly Val 
45 50 55 

Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly Gin Asp Tyr Ser Leu Thr 
60 65 70 

He Ser Ser Leu Asp Tyr Glu Asp Met Gly He Tyr Tyr Cys Leu Gin 
75 80 85 90 

Tyr Asp Glu Leu Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys 
95 100 105 

Arg * 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 999 base pairs 

(B) TYPE : nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "cDNA domains with synthetic linker (s) 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: Hindlll -EcoRI insert of Fv.4715-myc 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 40. .468 

(D) OTHER INFORMATION: /products tt pelB-VH4715 M 

(ix) FEATURE: 

(A) NAME/KEY: sig_peptide 

(B) LOCATION: 40. .105 

<D) OTHER INFORMATION: /products "pectate lyase" 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 106. .465 

(D) OTHER INFORMATION: /product = 0 VH4715" 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 520. .963 

(D) OTHER INFORMATION: /products "pelB- VL4715 -myc* 1 

(ix) FEATURE: 

(A) NAME /KEY: sig_peptide 

(B) LOCATION: 520. .585 

(D) OTHER INFORMATION: /products "pectate lyase" 

(ix) FEATURE: 

(A) NAME/KEY: raat_peptide 

(B) LOCATION: 586. .927 

(D) OTHER INFORMATION: /products "VL4715" 

(ix) FEATURE: 

(A) NAME/KEY: misc_RNA 

(B) LOCATION: 928. .960 

(D) OTHER INFORMATION: /products "myc- tag" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

AAGCTTGCAT GCAAATTCTA TTTCAAGGAG ACAGTCATA ATG AAA TAG CTA TTG 54 

Met Lys Tyr Leu Leu 
-22 -20 

CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA GCG ATG 102 
Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met 
«15 -10 -5 

GCC CAG GTG CAG CTG CAG GAG TCA GGG GGA GAC TTA GTG AAG CCT GGA 150 
Ala Gin Val Gin Leu Gin Glu Ser Gly Gly Asp Leu Val Lys Pro Gly 
15 10 15 

GGG TCC CTG ACA CTC TCC TGT GCA ACC TCT GGA TTC ACT TTC AGT AGT 198 
Gly Ser Leu Thr Leu Ser Cys Ala Thr Ser Gly Phe Thr Phe Ser Ser 
20 25 30 

TAT GCC TTT TCT TGG GTC CGC CAG ACC TCA GAC AAG AGT CTG GAG TGG 246 
Tyr Ala Phe Ser Trp Val Arg Gin Thr Ser Asp Lys Ser Leu Glu Trp 
35 40 45 

GTC GCA ACC ATC AGT AGT ACT GAT ACT TAT ACC TAT TAT TCA GAC AAT 294 
Val Ala Thr He Ser Ser Thr Asp Thr Tyr Thr Tyr Tyr Ser Asp Asn 
50 55 60 

GTG AAG GGG CGC TTC ACC ATC TCC AGA GAC AAT GGC AAG AAC ACC CTG 342 
Val Lys Gly Arg Phe Thr He Ser Arg Asp Asn Gly Lys Asn Thr Leu 
65 70 75 

TAC CTG CAA ATG AGC AGT CTG AAG TCT GAG GAC ACA GCC GTG TAT TAC 390 
Tyr Leu Gin Met Ser Ser Leu Lys Ser Glu Asp Thr Ala Val Tyr Tyr 
80 85 90 95 

TGT GCA AGA CAT GGG TAC TAT GGT AAA GGC TAT TTT GAC TAC TGG GGC 438 
Cys Ala Arg His Gly Tyr Tyr Gly Lys Gly Tyr Phe Asp Tyr Trp Gly 
100 105 110 

CAA GGG ACC ACG GTC ACC GTC TCC TCA TAA TAAGAGCTAT GGGAGCTTGC 488 
Gin Gly Thr Thr Val Thr Val Ser Ser * 
115 120 

ATGCAAATTC TATTTCAAGG AGACAGTCAT A ATG AAA TAC CTA TTG CCT ACG 540 

Met Lys Tyr Leu Leu Pro Thr 
-22 -20 

GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA GCG ATG GCC GAC 588 
Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met Ala Asp 
-15 -10 -5 1 

ATC GAG CTC ACT CAG TCT CCA TTC TCC CTG ACT GTG ACA GCA GGA GAG 636 
He Glu Leu Thr Gin Ser Pro Phe Ser Leu Thr Val Thr Ala Gly Glu 
5 10 15 

AAG GTC ACT ATG AAT TGC AAG TCC GGT CAG AGT CTG TTA AAC AGT GTA 684 
Lys Val Thr Met Asn Cys Lys Ser Gly Gin Ser Leu Leu Asn Ser Val 
20 25 30 

AAT CAG AGG AAC TAC TTG ACC TGG TAC CAG CAG AAG CCA GGG CAG CCT 732 
Asn Gin Arg Asn Tyr Leu Thr Trp Tyr Gin Gin Lys Pro Gly Gin Pro 
35 40 45 

CCT AAA CTG TTG ATC TAC TGG GCA TCC ACT AGG GAA TCT GGA GTC CCT 780 
Pro Lys Leu Leu He Tyr Trp Ala Ser Thr Arg Glu Ser Gly Val Pro 
50 55 60 65 
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GAT CGC TTC ACA GCC AGT GGA TCT GGA ACA GAT TTC ACT CTC ACC ATC 828 

Asp Arg Phe Thr Ala Ser Gly Ser Gly Thr Asp Phe Thr Leu Thr He 
70 75 80 

AGC AGT GTG CAG GCT GAA GAC CTG GCA GTT TAT TAC TGT CAG AAT GAT 876 
Ser Ser Val Gin Ala Glu Asp Leu Ala Val Tyr Tyr Cys Gin Asn Asp 
85 90 95 

TAT ACT TAT CCG TTC ACG TTC GGA GGG GGG ACC AAG CTC GAG ATC AAA 924 
Tyr Thr Tyr Pro Phe Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys 
100 105 HO 

CGG GAA CAA AAA CTC ATC TCA GAA GAG GAT CTG AAT TAA TAAGATCAAA 973 
Arg Glu Gin Lys Leu He Ser Glu Glu Asp Leu Asn * 
"5 120 125 

CGGTAATAAG GATCCAGCTC GAATTC 999 

(2) INFORMATION FOR SEQ ID NO: 7: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
-22 -20 -is -io 

Ala Gin Pro Ala Met Ala Gin Val Gin Leu Gin Glu Ser Gly Gly Asp 
" 5 1 5 io 

Leu Val Lys Pro Gly Gly Ser Leu Thr Leu Ser Cys Ala Thr Ser Gly 
15 20 25 

Phe Thr Phe Ser Ser Tyr Ala Phe Ser Trp Val Arg Gin Thr Ser Asp 
30 35 40 

Lys Ser Leu Glu Trp Val Ala Thr lie Ser Ser Thr Asp Thr Tyr Thr 
45 50 55 

Tyr Tyr Ser Asp Asn Val Lys Gly Arg Phe Thr He Ser Arg Asp Asn 
60 65 70 

Gly Lys Asn Thr Leu Tyr Leu Gin Met Ser Ser Leu Lys Ser Glu Asp 
75 SO 85 90 

Thr Ala Val Tyr Tyr Cys Ala Arg His Gly Tyr Tyr Gly Lys Gly Tyr 
95 100 105 

Phe Asp Tyr Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser * 
HO 115 120 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUEN CE CH ARACTERISTICS: 

(A) LENGTH: 146 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
-22 -20 -15 -10 

Ala Gin Pro Ala Met Ala Asp He Glu Leu Thr Gin Ser Pro Phe Ser 
-5 1 5 io 

Leu Thr Val Thr Ala Gly Glu Lys Val Thr Met Asn Cys Lys Ser Gly 
15 20 25 

Gin Ser Leu Leu Asn Ser Val Asn Gin Arg Asn Tyr Leu Thr Trp Tyr 
30 35 40 

Gin Gin Lys Pro Gly Gin Pro Pro Lys Leu Leu He Tyr Trp Ala Ser 
45 50 55 

Thr Arg Glu Ser Gly Val Pro Asp Arg Phe Thr Ala Ser Gly Ser Gly 
60 65 70 

Thr Asp Phe Thr Leu Thr He Ser Ser Val Gin Ala Glu Asp Leu Ala 
75 80 85 90 

Val Tyr Tyr Cys Gin Asn Asp Tyr Thr Tyr Pro Phe Thr Phe Gly Gly 
95 100 105 

Gly Thr Lys Leu Glu He Lys Arg Glu Gin Lys Leu He Ser Glu Glu 
110 us 120 

Asp Leu Asn * 

125 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "cDNA domains with synthetic linker (s)" 

<vii) IMMEDIATE SOURCE: 

(B) CLONE: Hindlll-EcoRI insert of scFv .4715-myc 

(ix) FEATURE: 

(A) NAME /KEY: sig_peptide 

(B) LOCATION: 40. .105 

(D) OTHER INFORMATION: /products: "pectate lyase" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 106 . .465 

(D) OTHER INFORMATION: /product = "VH4715" 

<ix) FEATURE: 

(A) NAME /KEY: misc_RNA 

(B) LOCATION: 466. .510 

(D) OTHER INFORMATION: /products • (Gly4Ser) 3 - linker" 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 511. -852 

(D) OTHER INFORMATION : /product = "VL4715" 
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(ix) FEATURE : 

(A) NAME/KEY: misc_RNA 

(B) LOCATION: 853. .885 

(D) OTHER INFORMATION: /product= M myc-tag" 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:40. .888 

(D) OTHER INFORMATION :/product= 

"pelB-VH4715- (Gly4Ser)3-VL4715-myc" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

AAGCTTGCAT GCAAATTCTA TTTCAAGGAG ACAGTCATA ATG AAA TAC CTA TTG 54 

Met Lys Tyr Leu Leu 

-22 -20 

CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA GCG ATG 102 
Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met 
-15 -10 -5 

GCC CAG GTG CAG CTG CAG GAG TCA GGG GGA GAC TTA GTG AAG CCT GGA 150 
Ala Gin Val Gin Leu Gin Glu Ser Gly Gly Asp Leu Val Lys Pro Gly 
1 5 10 15 

GGG TCC CTG ACA CTC TCC TGT GCA ACC TCT GGA TTC ACT TTC AGT AGT 198 
Gly Ser Leu Thr Leu Ser Cys Ala Thr Ser Gly Phe Thr Phe Ser Ser 
20 25 30 

TAT GCC TTT TCT TGG GTC CGC CAG ACC TCA GAC AAG AGT CTG GAG TGG 246 
Tyr Ala Phe Ser Trp Val Arg Gin Thr Ser Asp Lys Ser Leu Glu Trp 
35 40 45 

GTC GCA ACC ATC AGT AGT ACT GAT ACT TAT ACC TAT TAT TCA GAC AAT 294 
Val Ala Thr lie Ser Ser Thr Asp Thr Tyr Thr Tyr Tyr Ser Asp Asn 
50 55 60 

GTG AAG GGG CGC TTC ACC ATC TCC AGA GAC AAT GGC AAG AAC ACC CTG 342 
Val Lys Gly Arg Phe Thr He Ser Arg Asp Asn Gly Lys Asn Thr Leu 
65 70 75 

TAC CTG CAA ATG AGC AGT CTG AAG TCT GAG GAC ACA GCC GTG TAT TAC 390 
Tyr Leu Gin Met Ser Ser Leu Lys Ser Glu Asp Thr Ala Val Tyr Tyr 
80 85 90 95 

TGT GCA AGA CAT GGG TAC TAT GGT AAA GGC TAT TTT GAC TAC TGG GGC 438 
Cys Ala Arg His Gly Tyr Tyr Gly Lys Gly Tyr Phe Asp Tyr Trp Gly 
100 105 110 

CAA GGG ACC ACG GTC ACC GTC TCC TCA GGT GGA GGC GGT TCA GGC GGA 486 
Gin Gly Thr Thr Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly 
115 120 125 

GGT GGC TCT GGC GGT GGC GGA TCG GAC ATC GAG CTC ACT CAG TCT CCA 534 
Gly Gly Ser Gly Gly Gly Gly Ser Asp He Glu Leu Thr Gin Ser Pro 
130 135 140 

TTC TCC CTG ACT GTG ACA GCA GGA GAG AAG GTC ACT ATG AAT TGC AAG 582 
Phe Ser Leu Thr Val Thr Ala Gly Glu Lys Val Thr Met Asn Cys Lys 
145 150 155 

TCC GGT CAG AGT CTG TTA AAC AGT GTA AAT CAG AGG AAC TAC TTG ACC 630 
Ser Gly Gin Ser Leu Leu Asn Ser Val Asn Gin Arg Asn Tyr Leu Thr 
"0 165 170 175 
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678 
726 
774 
822 
870 
924 



Trp Tyr Gin Gin ^ OCC ACT GGA 

— ffSSSSSS 5S = Sa '" 

w * s a U--K-sassss 

-r oo» *» <w S Si «» »• ** M »• 
S oiy «; w ™ »« ,„ ceo ttc ko «: 

s s s s s - - ~ - M - ^ 

SS o» »» — S5 

VOLKCOU TYPE- P£°"£ n S EQ id HO: 10'- 

^Me^aGlnValOln-OlnGluSe, V l0 

Ala Gin Pro Met i Ser oly 

' 5 Thr Ser Cys Ala Thr 

, .« Pro Gly Gly ser l*u Thr 

, m val W» G1 V 9X9 P 70 
_ Q( .r Asp ABO vaJ - * 

Tyr Tyr ser «v 65 Q1 Asp 

60 , ln «et Ser Ser M« ^ 8 Ser 90 

_ hr Leu Tyr Gln Met 85 
oiy ABn ™ ^ 80 Gly Tyr 

75 ^ B Ala Arg His Gly Tyr Tyr Gly W lQ5 

„. val Tyr Tyr Cys Ala ax 3 l00 

«« ser Pro «» Ser l*u Thr vai l50 
I,eu Thr Gin Ser l45 

t An 
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val Asn Gin 

Mn Ser leu 1*" Ser 170 
. ftcn cvb LY8 Ser Gly Gin Ser i6 5 
Thr Met ABn Cya 

« TVT Gin Gin !«• «o GlY Gin Pro Pg 

lell nw Thr Trp Tyr oxu i80 
Arg Asn Tyr ^ ^ ^ 

Ma Ser Thr »g GXu Ser Gly Val vt 
t*u Leu He Tyr Trp Ala Ser t 

190 Phe Thr Leu Thr He Ser Ser 

rw Ser Gly Thr Asp Phe Thr i* ^ 
Phe Thr Ala Ser Gly Ser o y M0 

205 ^ cvb Gin Asn Asp Tyr Thr 

, ^ a Glu Asp «u Ala val Tyr Tyr Cys 

Val Gin Ala Glu **P 225 

220 «v G1V Thr l*y 8 I- G1U 116 9 «0 

^ Pro Phe Thr Phe Gly Gly Gly Th ^ 

, 2) INFORMATION FOR SB0 - *> "° ! 

D TOPOIOGV: linear ,. nk , r (»)' 

,ixl FEATORK: t peptide 
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(ix) FEATURE: 

(A) NAME/KEY: sigjpeptide 
<B) LOCATION: 913. .978 

(D) OTHER INFORMATION : /product = "pectate lyase" 

( ix) FEATURE : 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 979. .1299 

(D) OTHER INFORMATION: /product = B VL3418 W 

{ ix) FEATURE : 

(A) NAME /KEY: misc_RNA 

(B) LOCATION: 1300. .1344 

(D) OTHER INFORMATION: /products "linker V 
(Gly4Ser) 2Gly4Val" 

(ix) FEATURE: 

(A) NAME /KEY: ntat_peptide 

(B) LOCATION: 1345. .1686 

(D) OTHER INFORMATION: /product = "VL4 715" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AAGCTTGCAT GGAAATTCTA TTTCAAGGAG ACAGTCATA ATG AAA TAC CTA TTG 54 

Met Lys Tyr Leu Leu 
-22 -20 

CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA GCG ATG 102 
Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met 
-15 -10 -5 

GCC CAG GTG CAG CTG CAG GAG TCA GGG GGA GAC TTA GTG AAG CCT GGA 150 
Ala Gin Val Gin Leu Gin Glu Ser Gly Gly Asp Leu Val Lys Pro Gly 
15 10 15 

GGG TCC CTG ACA CTC TCC TGT GCA ACC TCT GGA TTC ACT TTC AGT AGT 198 
Gly Ser Leu Thr Leu Ser Cys Ala Thr Ser Gly Phe Thr Phe Ser Ser 
20 25 30 

TAT GCC TTT TCT TGG GTC CGC CAG ACC TCA GAC AAG AGT CTG GAG TGG 246 
Tyr Ala Phe Ser Trp Val Arg Gin Thr Ser Asp Lys Ser Leu Glu Tip 
35 40 45 

GTC GCA ACC ATC AGT AGT ACT GAT ACT TAT ACC TAT TAT TCA GAC AAT 294 
Val Ala Thr lie Ser Ser Thr Asp Thr Tyr Thr Tyr Tyr Ser Asp Asn 
50 55 60 

GTG AAG GGG CGC TTC ACC ATC TCC AGA GAC AAT GGC AAG AAC ACC CTG 342 
Val Lys Gly Arg Phe Thr lie Ser Arg Asp Asn Gly Lys Asn Thr lieu 
65 70 75 

TAC CTG CAA ATG AGC AGT CTG AAG TCT GAG GAC ACA GCC GTG TAT TAC 390 
Tyr Leu Gin Met Ser Ser Leu Lys Ser Glu Asp Thr Ala Val Tyr Tyr 
80 85 90 95 

TGT GCA AGA CAT GGG TAC TAT GGT AAA GGC TAT TTT GAC TAC TGG GGC 438 
Cys Ala Arg His Gly Tyr Tyr Gly Lys Gly Tyr Phe Asp Tyr Trp Gly 
100 105 110 

CAA GGG ACC ACG GTC ACC GTC TCC TCA GGT GGA GGC GGT TCA GGC GGA 486 
Gin Gly Thr Thr Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly 
115 120 125 

GGT GGC TCT GGC GGT GGC GGA TCG GCC GGT TCG GCC CAG GTC CAG CTG 534 
Gly Gly Ser Gly Gly Gly Gly Ser Ala Gly Ser Ala Gin Val Gin Leu 
130 135 140 
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CAA CAG TCA GGA CCT GAG CTG GTA AAG CCT GGG GCT TCA GTG AAG ATG 582 
Gin Gin Ser Gly Pro Glu Leu Val Lys Pro Gly Ala Ser Val Lys Met 
145 ISO 155 

TCC TGC AAG GCT TCT GGA TAC ACA TTC ACT AGC TAT GTT ATG CAC TGG 630 
Ser Cys Lys Ala Ser Gly Tyr Thr Phe Thr Ser Tyr Val Met His Trp 
160 165 170 175 

GTG AAA CAG AAG CCT GGG CAG GGC CTT GAG TGG ATT GGA TAT ATT TAT 678. 
Val Lys Gin Lys Pro Gly Gin Gly Leu Glu Trp lie Gly Tyr He Tyr 
180 185 190 

CCT TAC AAT GAT GGT ACT AAG TAC AAT GAG AAG TTC AAA GGC AAG GCC 726 
Pro Tyr Asn Asp Gly Thr Lys Tyr Asn Glu Lys Phe Lys Gly Lys Ala 
195 200 205 

ACA CTG ACT TCA GAC AAA TCC TCC AGC ACA GCC TAC ATG GAG CTC AGC 774 
Thr Leu Thr Ser Asp Lys Ser Ser Ser Thr Ala Tyr Met Glu Leu Ser 
210 215 220 

AGC CTG ACC TCT GAG GAC TCT GCG GTC TAT TAC TGT TCA AGA CGC TTT 822 
Ser Leu Thr Ser Glu Asp Ser Ala Val Tyr Tyr Cys Ser Arg Arg Phe 
225 230 235 

GAC TAC TGG GGC CAA GGG ACC ACC GTC ACC GTC TCC TCA TAA 864 
Asp Tyr Trp Gly Gin Gly Thr Thr Val Thr Val Ser Ser * 
240 245 250 

TAAGCTAGCG GAGCTGCATG CAAATTCTAT TTCAAGGAGA CAGTCATA ATG AAA TAC 921 

Met Lys Tyr 

-22 -20 

CTA TTG CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCT GCC CAA CCA 969 
Leu Leu Pro Thx Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro 
-15 -10 -5 

GCG ATG GCC GAC ATC GAG CTC ACC CAG TCT CCA TCT TCC ATG TAT GCA 1017 
Ala Met Ala Asp He Glu Leu Thr Gin Ser Pro Ser Ser Met Tyr Ala 
15 10 

TCT CTA GGA GAG AGA ATC ACT ATC ACT TGC AAG GCG AGT CAG GAC ATT 1065 
Ser Leu Gly Glu Arg He Thr He Thr Cys Lys Ala Ser Gin Asp lie 
15 20 25 

AAT ACC TAT TTA ACC TGG TTC CAG CAG AAA CCA GGG AAA TCT CCC AAG 1113 
Asn Thr Tyr Leu Thr Trp Phe Gin Gin Lys Pro Gly Lys Ser Pro Lys 
30 35 40 45 

ACC CTG ATC TAT CGT GCA AAC AGA TTG CTA GAT GGG GTC CCA TCA AGG 1161 
Thr Leu He Tyr Arg Ala Asn Arg Leu Leu Asp Gly Val Pro Ser Arg 
50 55 60 

TTC AGT GGC AGT GGA TCT GGG CAA GAT TAT TCT CTC ACC ATC AGC AGC 1209 
Phe Ser Gly Ser Gly Ser Gly Gin Asp Tyr Ser Leu Thr He Ser Ser 
65 70 75 

CTG GAC TAT GAA GAT ATG GGA ATT TAT TAT TGT CTA CAA TAT GAT GAG 1257 
Leu Asp Tyr Glu Asp Met Gly He Tyr Tyr Cys Leu Gin Tyr Asp Glu 
80 85 90 

TTG TAC ACG TTC GGA GGG GGG ACC AAG CTC GAG ATC AAA CGG GGT GGA 1305 
Leu Tyr Thr Phe Gly Gly Gly Thr Lys Leu Glu He Lys Arg Gly Gly 
95 100 105 

GGC GGT TCA GGC GGA GGT GGC TCT GGC GGT GGC GGA GTC GAC ATC GAA 1353 
Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Val Asp He Glu 
1X 0 115 120 125 
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"» Gln S 130 ^ CT A AAT CM 

„ tpc GOT CAG AST CTO ser val Asn Gin 

^ ATG tfn C^s i£ £ Sy cm Ser l*» " »* 

Thr Met Asn Cys W 150 CAG CCT CCT AAA 
n «» TAC CAG CAG AAG OCA GGG CAG ^ prQ ty8 

^GG AAC TAC TTG ACC TGG TAC ^ Ly8 170 

^ Asn Tyr Leu Tbr ^ 165 CCT gat CGC 

«-.-sssssssss ss — 

S; n. iy , ™ " no «jt 

1,5 <»S15SSS»»-SS 

Mr GGG ACC AAG CTC GAA M Rrg 

225 

TAAGCGGCCG CGAATTC 
ii INFORMATION FOR « 10 ^ 

IS TOPOLOGY: linear 

Met i*y B Tyr w» ^ u Pr -is G1 Asp 

val Ala thr He ser ser Thr ^ 

^ Mn val Lys Gly **9 *• Thr ^ 

Tyr Tyr Ser Asp Asn ^ Mp 
.vr ueu Gin Met ser ser ueu X-ys Ser ?0 

Giy «y ^ LeU 'so ^ Gly Lys Giy Tyr 

75 aia Arg Hie Gly Tyr Tyr Gly lQ5 

, „»i tvt Tyr Cys Ala m 100 
Thr Ala val Tyr ^ 



1401 
1449 
1497 
1545 
1593 
1641 
1689 
1706 
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n Glv Gly Ser Gly Q ly Gly ° Y 135 

wa «. v. « - - ffi - 01 ' " - w ^ s „ 

140 Gly Tyr Thr Phe Thr s 

va i Lys «et Ser Cys Lys Ala Ser 
W| Ser Val W l60 Glu Trp 

I 55 „ ln T.VS Pro Gly Gin Giy 

«.! Bet His Trp val Lys Gin Lys 

" ^.o.yr.sn.spGly--^--^ 
XleGlyTyrXleTyr-Tyr 1,5 erSetSerThrA la 

n a Thr Leu Thr Ser Asp Lys ser ^ 
PhetyB GlV^AlaThr 

S er Ser Leu Thr Ser Glu Asp Ser 
Met Glu Leu Ser Ser 

220 M Gly Gin Gly Thr Thr Val Thr ^ 

c r Ara Arg Phe Asp Tyr Trp Gly e 245 
Cys Ser Arg 240 
235 

Ser Ser * 

INFORMATION FOR SEQ ID »: ^ 



(2) 



sin 



Met Lys Tyr Leu Leu Pro *rw 

-22 " 20 lu ^ Thr Gin Ser 1£) 

»i» Met Ala Asp H e u 5 
Ala Gin Pro Ala Met x 

' 5 ne Thr Us Thr Cys Lys Aia 

_ Ala Ser Leu Gly Glu Arg He Thr 
Met Tyr Ala Ser Lys 
.^XleAsnThrTyrLeuThrTrp-GlnGlnLys „ 

^AsnArgLeu^uAspGlyVal 
Gly Ser Gly Gin Asp Tyr Ser Leu Thr 



S er Pro Lys Thr Leu He Tyr A|3 



Pto Ser Arg Phe Ser Gly Ser 

^ Glu Asp Met Gly He TYT Tyr cys Leu ^ 

ne Ser Ser Leu Asp Tyr Glu Asp 85 
15 * Phe Gly Gly Gly «* Lys Leu Glu IJ. 

. on Glu teu Tyr Thr Phe Giy 
Tyr ASP Glu LS ^ ^ 

o „ giv Giy Giy Giy ser Giy Giy y 
„ Giy Giy *y Giy Gly xx5 
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— — S*" 

140 -y,. Gin Gin ItfB l70 

, *™ Aisn Tyr Leu Thr TO Tyr ^ 

Val ABn Gin wg ^ Va l 
155 ^ tip Ala Ser Thr Arg Olu lfl5 

, **r G1Y Ser Gly Thr Asp 

205 „w riv Gly Thr l>ys Gxu 

Pro Phe thr Phe Gly Gly W ^ 0 
Tyr Thr Tyr Pro Phe 



220 

Lys «g * 

235 



(2) IHFOBMATION FOR SBQ 

■ s sagas--'- 

(D) TOPOLOGY. 

TfFR- other nucleic acx ic DNA . 

(xi) SKQOEHCE OBSCRXPTIOK: SBQ 
CAC CATC^ AGAGACAATG GCAAG 

g tSSSgy= 1^ 

GAGCGCGAGC TCGGCCGAAC CGGCCGA^ 
(2) INFORMATION FOR SBQ 10 »' l6: 
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a ^ cco=™*c cc^ cn^ 

CTACKTGAKT TCGCTAGCIT AX 
(2) x^TXON FOR SEQ XD »: ,8: 

c s&dbdsesS: single 
iS linear 

...» hOISCOI* TYPB: other .^^.^.tie «»• 

<ll) ^description-- /de8C 

M INFORMATION FOR SBQ *> l9: 

. § Safer- 
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l!tl) ^ -i— - SM » - 

tc) STOW" xinear 

jmicnoG*^ tcctcgrcw- cgccrccgcc 

(D) TOPOU)GY. J-* 

cocker c^rrcc 

(D) TOPOl^GY- axu 
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,r\ sTRftHDBDtlSSS: single 
^CDLE TYPE: other nucleic J^ cle DNA" 

~« QEO ID »0: 23: 

B) XHFOHM^OH «>* SEQ ID »: 26: 
Si SsdbdsbSS: single 

-oleCOI* TffK: other nucleic «£ t|-tlc DH A- 
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g) TOPOliOOT: linear 

cc 

„,———"""■"' 

orntiH tvpb: other nucleic acia DWV . 

GTTAGATCrC GAGCTTGGTC CC 
(2) XHKO^TXOH FOR SKQ ™ ** 



PCT/EF96/03605 

63 



(D ) TOPOLOGY: line« 

«« i ~ r s - ser 01 » 

1 

si 

^ „ «. I» «• ™ " 
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CLAIMS 



.„ride chains. »ner="i (v„) "» 

— a- »o ^ 
„ //v - A and V„-B//V L -B) • 

A//VL M claim i» xn 

body fragment analogue accar xn £eaae££s d 

'which the two V„ • linKer . 
without an intermediate 

■ hod v fragment analogue » cC *£f to each other 
4 . An antibody tr g direCtly connected 
which the two V L « s Unker . 

^ ^ an intermediate y r 
without an m ^ 3 or 

, » anti^ CS^^ £iISC 

^ 4. « -* ^ « v.. ana - „ 

polypi" 6 chau ' 

j IT 



tr — *- 

a second V u . 2> in 

^C^ide lit***' P 
a P P residue- 



a ^ I acid residue 
one amino acia 
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rv.-B), and xn /in K thus I v h" a h 

Iher by a peptide linker tW . ^^ sae n^ther_by_a 
other by J> ^ ^ ar ej*njiiJ^^^ 
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-A * LIl 
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to claim 2, 



.• h ndv fragment analogue acCor ^" 9 " t Vh (V „-A) 
^ a second V L (V,-B)w # ^ . 



tc antiDuu* — ^ 

- - -^cX* rr ' 

. , in i-ono*" 3 * 8 ""^ 3 .Ration -UK*. £ot 

analogue according to an, 

compiles incorporating 

(1) transforming a host Y ^ „ wltho ut 

a DBA coding the two V. 
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\>bv the connected V„ • 
whereby ww ^ fcw0 

double heao lle ctxng the a 

(4) optionally «- 
fragment analogue. ^ 

= for producing an whicn 
APr °Tn g to-yoneo f cla^^ 0 ' 

analogue according corp0 rating W that 

comprises t t hoS t by m _ ^ or 

t irst host a l» de link er, incorpor ating i^o 

without a connecting P ^ by in corp ^ w - tb 

- (2) transforming ^ encod . ng the t «o v L s 

tha t second hos- ^ ^ ^ 

or without a conne ^ second tr ^ 

(3) cultunng the ^ v B 

— C tvT^ctU, - ttfQ conne cted 
connected V L b. colleCti ng separately 

M optionally ed ^ conneCte d V L 's 

V - ^ tW ; g thT connected V bead antibody 

(5) combining th ^ f oEBl a d oub 

^aer conditions that 

fragment analogue- ^ ^ ^ the 

CO "' Materia. . ^"" the 9 e»«» 

lactic acW WCte Mlongl ag to the 9 „ la , 
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M K Y L L P T 
pelB AAGCTTGCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACG 

LEADER 



G L L L L A 



M A 



GCAGCCGCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGCISC^ 



GAGTCAGGGGGAGACTTAGTGAAGCCTGGAGGGTCCCTGACACTCTCCTGTGCAACCTCT 



GGATTCACTTTCAGTAGTTATGCCTTTTCTTGG GTCCGCC AGACCTCAGACAAGAGTCTG 



VH4715 GAGTGGGTCGCAACCATCAGTAGTACTGATACTTATACCTATTATTCAGACAATGTGAAG 



GGGCGCTTCACCATCTCCAGAGACAATGGCAAGAACACCCTGTACCTGCAAATGAGCAGT 



CTGAAGTCTGAGGACACAGCCGTGTATTACTGTGCAAGACATGGGTACTATGGTAAAGGC 



TATTTTGACTACTGGGGCCAAGGGACCACG3ICACCGTCTCCTCATAATAAGAGCTATGG 

M K Y L L P T 
GAGCTTGCATGCAAATTCTATTTCAAGGAGACAGTCATAATGAAATACCTATTGCCTACG 

LEADER 



AAAGLLLLAAQPAMA 
GCAGCCGCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCGACATCGA2CTCACT 



Q S P 



S L .Tv/nV^V?;: A G E K V :^-:v:H-^N^r0^K;-V; S ! ' 



CAGTCTCCATTCTCCCTGACTGTGACAGCAGGAGAGAAGGTCACTATGAATTGCAAGTCC 



•• ;G-:-. Q S L L N : ...:-g^ypjKi^^Q R N Y : L -^Ipg^ 
GGTCAGAGTCTGTTAAACAGTGTAAATCAGAGGAACTACTTGACCTGGTACCAGCAGAAG 



VL4715 



CCAGGGCAGCCTCCTAAACTGTTGATCTACTGGGCATCCACTAGGGAATCTGGAGTCCCT 

[:vB:p;v-R;v.. :: f t ;a - sm3^$MM^T- m-' p: : t;; ^gs^^ 

GATCGCTTCACAGCCAGTGGATCTCGAACAGATTTCACTCTCACCATCAGCAGTGTGCAG 



GCTGAAGACCTGGCAGTTTATTACTGTCAGAATGATTATACTTATCCGTTC 



Myc-tag \ g g;; ; :;t • .' k-^4»- B-n&^fe& ( eqkliseed l n) 

GGGGGGACCAAGCZO^ATCAAACGGGAACAAAAACTCATCTCAGAAGAGGATCTGAAT 



TAATAAGATCAAACGGTAATAAGGATCCAGCTCGAATTC 
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GCAGCCGCTGGATTGTTATTACTCGCTGCCCAACCAGCGATGGCCCAGGTGCAGCIGCAQ 



GAGTCAGGGGGAGACTTAGTGAAGCCTGGAGGGTCCCTGACACTCTCCTGTGCAACCTCT 



GGATTCACTTTCAGTAGTTATGCCTTTTCTTGGGTCCGCCAGACCTCAGACAAGAGTCTG 



VH471 5 GAGTGGGTC GC AACC ATCAGTAGTACTGATACTTATACCTATTATTCAGAC AATGTGAAG 
GGGCGC TTCAC CATCTCCAGAGACAATGGCAAGAACACCCTGTACCTGCAAATGAGCAGT 
CTGAAGTCTGAGGACACAGCCGTGTATTACTGTGCAAGACATGGGTACTATGGTAAAGGC 



milBiMP^I Ig G G G s 



LINKER 
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G G G G S G 



G G S |[ D vI-:--.^-0fc T: ; :| : ^Q:^^cP- •• F-v^l 



GGCGGAGGTGGCTCTGGCGGTGGCGGATCGGACATCGaSSZCCACTCAGTCTCCATTCTCC 



L T v T A G B •K ; L:;yH : T : ^M . -Kf .^G^.: : K- S::-G : feQ-,o:;S: L : p& ,1 
CTGACTGTGACAGCAGGAGAGAAGGTCACTATGAATTGCAAGTCCGGTCAGAGTCTGTTA 



VL4715 



AACAGTGTAAATCA3AGGAACTACTTGACCTGGTACCAGCAGAAGCCAGGGCAGCCTCCT 
AAACTGTTG A T CT A 2TG GGC ATCC ACTAGGGAATCTGG AGTC C CTG ATCGC TTCAC AGC C 



AGTGGATCTGGAACAGATTTCACTCTCACCATCAGCAGTGTGCAGGCTGAAGACCTGGCA 



GTTTATTACTGTCAGAATGATTATACTTATCCGTTCACGTTCGGAGGGGGGACCAAGCTC 



Myc-tag I -E 1 i ; :K R } Cg Q_ 
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3> 



GA^TCAAACGGGAACAAAAACTCATCTCAGAAGAGGATCTGAATTAATAAGATCAAACG 



GTAATAAGGATCCAGCTCGAATTC 
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VH4715 
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:CAGACCTCAGACAAGAGTCTG 
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S G G G 
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LEADER GAGACAGTCATAATGAAATACCTATTGCCTACGGCAGCCGCTGGATTGTTATTACTCGCT 



A Q P A M A \ -jECife*^ 
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.L*^€ .E.- -.RVX : ::::.T, ;X4/&:^<fe^ 
VL34 1 8 CTAGGAGAGAGAATCACTATC ACTTGCAAGGCGAGTCAG^CATOAATACCTATTTAACC 



TGGTTCCAGCAGAAACCAGGGAAATCTCCCAAGACCCTGATCTATCGTGCAAACAGATTG 
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CTAGATGGGGTCCCATCAAGGTTCAGTGGCAGTGGATCTGGGCAAGATTATTCTCTCACC 



ATCAGCAGCCTGGACTATGAAGATATGGGAATTTATTATTGTCTACAATATGATGAGTTG 



Y T F G G G T^^.^ii • ; X. .. K R\\ G G G G S G G~ 
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TACACGTTCGGAGGGGGGACCAAGCTCGAGATCAAACGGGGTGGAGGCGGTTCAGGCGGA 



G G S G G G G~^NOSi:I:.v-g -.-L T • " Q'^i^^gB^S^M^i 
GGTGGCTCTGGCGGTGGC3GAGTCGACATCGAACTCACTCAGTCTCCATTCTCCCTGACT 



V ' T ' ■ . A G E v./: K K S GW&^¥ml^XMB 



GTGACAGCAGGAGAGAAGGTCACTATGAATTGCAAGTCCGGTCAGAGTCTGTTAAACAGT 



GTAAATCAGAGGAACTACTTGACCTGGTACCAGCAGAAGCCAGGGCAGCCTCCTAAACTG 
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T^GATCTACTGGGCATCCACTAGGGAATCTGGAGTCCCTGATCGCTTCACAGCCAGTGGA 
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TCTGGAACAGATTTCACTCTCACCATCAGCAGTGTGCAGGCTGAAGACCTGGCAGTTTAT 



Y C ; Q N D Y T^^:^:^^ ;, F .G 
TACTGTCAGAATGATTATACTTATCCGTTCACGTTCGGAGGGGGGACCAAGCTCGAAATC 
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AAACGGTAATAA GCGQCCG2 GAATTC 
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